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( ( )MPOSlT10NS AND METHODS FOR THE THEFL\PY AND DIAGNOSIS OF 

PROSTATE CANCER 

TIXlfNlCAL FIELD OF THE INVENTION 

The present invention relates generally to therapy and diagnosis of 
cancer, such as prostate cancer. The invention is more specifically related to 
pol\ fx'ptiJcs. comprising at least a portion of a prostate-specific protein, and to 
pu]\ nucleotides encoding such polypeptides. Such polypeptides and polynucleotides 
JO arc uscJul in plurmaceuticaJ compositions, e.g., vaccines, and other compositions for 
the di^ignosis and ireannent of prostate cancer. 

BACKGROl 'ND OF THE INVENTION 

Cancer is a significant health problem throughout the world. Although 
Cancer is a significant health problem throughout the world. Although advances have 

! 5 been made in detection and therapy of cancer, no vaccine or other universally successful 
method for prex ention or treatment is currently available. Current therapies, which are 
generally based on a combination of chemotherapy or surgery and radiation, continue to 
prove inadequate in many patients. 

Prostate cancer is the most common form of cancer among males, with 

20 an estimated incidence of 30% in men over the age of 50. Overwhelming clinical 
evidence shows that human prostate cancer has the propensit>' to metastasize to bone, 
and the disease appears to progress inevitably from androgen dependent to androgen 
refractors status, leading to increased patient mortality. This prevalent disease is 
currently the second leading cause of cancer death among men in the U.S. 

2> In spite of considerable reseaich into therapies for the disease, prostate 

cancer remains difficult to treat. Commonly, treatment is based on surgery and/or 
radiation tlierap} . but these methods are ineffective in a significant percentage of cases. 
Two previously identified prostate specific proteins - prostate specific antigen (PSA) 
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and prostatic acid phosphatase (PAP) - have limited therapeutic and diagnostic 
potential. For example, PSA levels do not always correlate well with the presence of 
prostate cancer, being positive in a percentage of non-prostate cancer cases, including 
benign prostatic hyperplasia (BPH). Furthermore, PSA measurements correlate with 
5 prostate volume, and do not indicate the level of metastasis. 

In spite of considerable research into therapies for these and other 
cancers, prostate cancer remains difficult to diagnose and treat effectively. Accordingly, 
there is a need in the art for improved methods for detecting and treating such cancers. 
I The present invention fiilfills these needs and further provides other related advantages. 



1 0 SUMMARY OF THE INVENTION 

In one aspect, the present invention provides polynucleotide 
compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID NO: 1-111, 115-171, 173-175, 
177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 381, 382 and 384-476, 524, 

15 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593-606, 618-626, 630, 631, 634, 
636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 735, 737-739, 751, 753, 764, 765, 
773-776 and 786-788; 

(b) complements of the sequences provided in SEQ ID NO: 1-111, 
115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 381, 382 

20 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593-606, 618- 
626, 630, 631, 634, 636, 639-655, 674, 680, 681, 71 1, 713, 716, 720-722, 735, 737-739, 
751, 753, 764, 765, 773-776 and 786-788; 

(c) sequences consisting of at least 20 contiguous residues of a 
sequence provided in SEQ ID NO: 1-111, 115-171, 173-175, 177, 179-305, 307-315, 

25 326, 328, 330, 332-335, 340-375, 381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 
536, 552, 569-572, 587, 591, 593-606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 
681, 711, 713, 716, 720-722, 735, 737-739, 751, 753, 764, 765, 773-776and 786-788; 

(d) sequences that hybridize to a sequence provided in SEQ ID NO: 
1-111, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 
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381. 382 and 384-476. 524. 526. 530, 531, 533, 535, 536, 552, 569-572, 587. 591, 593- 
606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 68K 711, 713, 716, 720-722, 735, 
737-739, 751. 753, 764, 765, 773-776 and 786-788, under moderately stringent 
conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 
NO: 1-111, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340- 
375, 381, 382 and 384-476, 524. 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 
593-606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 
735, 737-739, 751, 753, 764, 765, 773-776 and 786-788; 

(0 sequences having at least 90% identity to a sequence of SEQ ID 
NO: 1-111, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340- 
375, 381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 
593-606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 681, 71 1, 713, 716, 720-722, 
735, 737-739, 751, 753, 764, 765, 773-776 and 786-788; and 
15 (g) degenerate variants of a sequence provided in SEQ ID NO: 1- 

111, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375,381, 
382 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593-606, 
618-626, 630, 631, 634, 636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 735, 737- 
739, 751, 753, 764, 765, 773-776 and 786-788. 
20 In one preferred embodiment, the polynucleotide compositions of the 

invention are expressed in at least about 20%, more preferably in at least about 30%), 
and most preferably in at least about 50% of prostate tissue samples tested, at a level 
that is at least about 2-fold, preferably at least about 5-fold, and most preferably at least 
about 10-fold higher than that for other normal tissues. 
25 The present invention, in another aspect, provides polypeptide 

compositions comprising an amino acid sequence that is encoded by a polynucleotide 
sequence described above. 

The present invention further provides polypeptide compositions 
comprising an amino acid sequence selected from the group consisting of sequences 
30 recited in SEQ ID NO: 112^114, 172, 176, 178. 327, 329, 331, 336, 339, 376-380, 383, 
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477-483, 496. 504, 505, 519, 520, 522, 525, 527, 532, 534, 537-551, 553-568, 573-586, 
588-590, 592, 627-629, 632, 633, 635, 637, 638, 656-671, 675, 683, 684, 710, 712, 714, 
715, 717-719, 723-734, 736, 740-750, 752, 754, 755, 766-772, 777-785 and 789-791. 

In certain preferred embodiments, the polypeptides and/or 
5 polynucleotides of the present invention are immunogenic, i.e.^ they ai*e capable of 
eliciting an irmnune response, particularly a humoral and/or cellular immune response, 
as further described herein. 

The present invention further provides fragments, variants and/or 
deri\aiiN cs of the disclosed polypeptide and/or polynucleotide sequences, wherein the 

10 fragments, x ariants and/or derivatives preferably have a level of immunogenic activity 
of at least about 50*! o, preferably at least about 70% and more preferably at least about 
90% of the ic\'el of immunogenic activity of a polypeptide sequence set forth in SEQ ID 
NO: 1 12-1 14, 172, 176, 178, 327, 329, 331, 336, 339, 376-380, 383, 477-483, 496, 504, 
505, 519 520, 522, 525, 527, 532, 534, 537-551, 553-568, 573-586, 588-590, 592, 627- 

15 629, 632, 633, 635, 637, 638, 656-671, 675, 683, 684, 710, 712, 714, 715, 717-719, 
723-734, 736, 740-750, 752, 754, 755, 766-772, 777-785 or 789-791, or a polypeptide 
sequence encoded by a polynucleotide sequence set forth in SEQ ID NO: 1-111, 115- 
171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 381, 382 and 
384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593-606, 618-626, 

20 630, 631, 634, 636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 735, 737-739, 751, 
753, 764, 765, 773-776 and 786-788. 

The present invention further provides polynucleotides that encode a 
polypeptide described above, expression vectors comprising such polynucleotides and 
host cells transfoiTned or transfected with such expression vectors. 

25 Within other aspects, the present invention provides pharmaceutical 

compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 

Within a related aspect of the present invention, pharmaceutical 
composhions, e.g., vaccine compositions, are provided for prophylactic or therapeutic 

30 applications. Such compositions generally comprise an immunogenic polypeptide or 
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polynucleotide of the invention and an immunostimulant, such as ^in adjuvant, together 
with a physiologically acceptable carrier. 

The present invention further provides pharmaceutical compositions that 
comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a polypeptide of the present invention, or a fragment thereof; and (b) a ph\ siological]y 
acceptable carrier. 

Within further aspects, the present invention provides pharmaceutical 
compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
described above and (b) a pharmaceutically acceptable carrier or excipient. Illustrative 
antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 
and B cells. 

Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described above 
cmd (b) an immunostimulant. 

The present invention further provides, in other aspects, fusion proteins 
that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of pharmaceutical compositions, 
e vaccine compositions, comprising a physiologically acceptable carrier and/or an 
immunostimulant. The fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof, as described herein, and may further comprise 
one or inore polypeptide segments for facilitating and/or enhancing the expression, 
purification and/^or immunogenicity of the polypeptide(s). 

Within further aspects, the present invention provides methods for 
stimulating an immune response in a patient, preferably a T cell response in a human 
patient, comprising administering a pharmaceutical composition described herein. The 
patient may be afflicted with prostate cancer, in which case the methods provide 
treatment for the disease, or a patient considered to be at risk for such a disease may be 
treated prophylactically. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
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patient a phaimaceutical composition as recited above. The patient may be afflicted 
with prostate cancer, in which case the methods provide treatment for the disease, or a 
patient considered to be at risk for such a disease may be treated prophylactically. 

The present invention fnrther provides, within other aspects, methods for 
5 removing tmiior cells from a biological sample, comprising contacting a biological 
sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
permit the removal of cells expressing the polypeptide from the sample. 

Within related aspects, methods are provided for inhibiting the 
1 0 development of a cancer in a patient, comprising administering to a patient a biological 
sample treated as described above. 

Methods are further provided, within other aspects, for stimulating 
and/or expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
15 polynucleotide encoding such a polypeptide; and (iii) an antigen presenting cell that 
expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
prepared as described above are also provided. 

Within further aspects, the present invention provides methods for 
20 inhibiting the development of a cancer in a patient, comprising administering to a 
patient an effective amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
development of a cancer in a patient, comprising the steps of: (a) incubating CD4"^ 
and/or CD8+ T cells isolated from a patient with one or more of: (i) a polypeptide 
25 comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
of the proliferated T cells, thereby inhibiting the development of a cancer in the patient. 
Proliferated cells may, but need not, be cloned prior to administration to the patient. 
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Within further aspects, the present invention provides methods for 
determining the presence or absence of a cancer, preferably a prostate cancer, in a 
patient comprising: (a) contacting a biological sample obtained from a patient with a 
binding agent that binds to a polypeptide as recited above; (b) detecting in the sample an 
5 amount of polypeptide that binds to the binding agent; and (c) comparing the amount of 
polypeptide with a predetermined cut-off value, and therefrom determining the presence 
or absence of a cancer in the patient. Within preferred embodiments, the binding agent 
is an antibody, more preferably a monoclonal antibody. 

The present invention also provides, within other aspects, methods for 
10 monitoring the progression of a cancer in a patient. Such methods comprise the steps 
of: (a) contacting a biological sample obtained from a patient at a first point in time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) 
and (b) using a biological sample obtained from the patient at a subsequent point in 
15 time; and (d) comparing the amount of polypeptide detected in step (c) with the amount 
detected in step (b), and therefrom monitoring the progression of the cancer in the 
patient. 

The present invention further provides, within other aspects, methods for 
determining the presence or absence of a cancer m a patient, comprising the steps of: (a) 

20 contacting a biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide of the present invention; (b) detecting in the sample a 
level of a polynucleotide, preferably mRNA, that hybridizes to the oligonucleotide; and 
(c) comparing the level of polynucleotide that hybridizes to the oligonucleotide with a 
predetermined cut-off value, and therefrom detennining the presence or absence of a 

25 cancer in the patient. Within certain embodiments, the amount of mRNA is detected 
via polymerase chain reaction using, for example, at least one oligonucleotide primer 
that hybridizes to a polynucleotide of the present invention, or a complement of such a 
polynucleotide. Within other embodiments, the amount of mRNA is detected using a 
hybridization technique, employing an oligonucleotide probe that hybridizes to an 

30 inventive polynucleotide, or a complement of such a polynucleotide. 
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In related aspects, methods are provided for monitoring the progression 
of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide of 
the present invention; (b) detecting in the sample an amount of a polynucleotide that 
5 hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) using a biological 
sample obtained from the patient at a subsequent point in time; and (d) comparing the 
amount of polynucleotide detected in step (c) with the amount detected in step (b), and 
tlierefrom monitoring the progression of the cancer in the patient. 

Within further aspects, the present invention provides antibodies, such as 
10 monoclonal antibodies, that bind to a polypeptide as described above, as well as 
diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 
15 references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 

BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE IDENTIFIERS 

Figure 1 illustrates the ability of T cells to kill fibroblasts expressing the 
representative prostate-specific polypeptide P502S, as compared to control fibroblasts. 
20 The percentage lysis is shown as a series of effector;target ratios, as indicated. 

Figures 2A and 2B illustrate the ability of T cells to recognize cells 
expressing the representative prostate-specific polypeptide P502S. In each case, the 
number of y-interferon spots is shown for different numbers of responders. In Figure 
2A, data is presented for fibroblasts pulsed with the P2S-12 pepfide, as compared to 
25 fibroblasts pulsed with a control E75 peptide. In Figure 2B, data is presented for 
fibroblasts expressing P502S, as compared to fibroblasts expressing REK-2/neu. 

Figure 3 represents a peptide competition binding assay showing that the 
P1S#10 pepfide, derived from P501S, binds HLA-A2. Pepfide P1S#10 inhibits HLA- 
A2 restricted presentation of fluM58 peptide to CTL clone D150M58 in TNF release 
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bioassay. D150M58 CTL is specific for the HLA-A2 binding influenza matrix peptide 
nuM58. 

Figure 4 illustrates the abilit>^ of T cell lines generated from P]S#10 
immunized mice to specifically lyse PlS#10-pulsed Jurkat A2Kb targets and P501S- 
5 transduced Jurkat A2Kb targets, as compared to EGFP-transduced Jurkat A2Kb. The 
percent lysis is shown as a series of effector to target ratios, as indicated. 

Figure 5 illustrates the ability of a T cell clone to recognize and 
specifically lyse Jurkat A2Kb cells expressing the representative prostate-specific 
polypeptide P501S, thereby demonstrating that the P1S#10 peptide may be a naturally 
1 0 processed epitope of the P501 S polypeptide. 

Figures 6A and 6B are graphs illustrating the specificity' of a CDS"" cell 
line (3A-n for a representative prostate-specific antigen (P501S). Figure 6A shows the 
results of a ^'Cr release assay. The percent specific lysis is shown as a series of 
effector:target ratios, as indicated. Figure 6B shows the production of interferon- 
15 gamma by 3A-1 cells stimulated with autologous B-LCL transduced with P501S, at 
varying effectontarget rations as indicated. 

Figure 7 is a Western blot showing the expression of P501S m 

baculovirus. 

Fi gure 8 illustrates the results of epitope mapping studies on P501S. 
20 Figure 9 is a schematic representafion of the P501S protein showang the 

location of transmembrane domains and predicted intracellular and extracellular 
domains. 

Figure 10 is a genomic map showing the location of the prostate genes 
P775P, P704P, B305D, P712P and P774P within the Cat Eye Syndrome region of 
25 chromosome 22q 1 L2 

Figure 11 shows the results of an ELISA assay to determine the 
specificit}^ of rabbit polyclonal antisera raised against P501S. 

SEQ ID NO: 1 is the determined cDNA sequence for Fl-13 
SEQ ID NO; 2 is the determined 3' cDNA sequence for FM2 
30 SEQ ID NO: 3 is the detemiined 5" cDNA sequence for Fl-12 
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4 is the determined 3' cDNA sequence for Fl-1 6 

5 is the determined 3' cDNA sequence for HI -1 

6 is the determined 3' cDNA sequence for HI -9 

7 is the determined 3' cDNA sequence for HI -4 

8 is the determined 3' cDNA sequence for Jl-17 

9 is the determined 5' cDNA sequence for Jl-17 
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s the determined 3' cDNA sequence for LI -12 
s the determined 5' cDNA sequence for LI -12 
s the determined 3' cDNA sequence for Nl-1862 
s the determined 5' cDNA sequence for Nl-1862 
s the determined 3' cDNA sequence for Jl-13 
s the determined 5' cDNA sequence for Jl-13 
s the determined 3' cDNA sequence for J 1-1 9 
s the determined 5' cDNA sequence for J 1-1 9 
s the determined 3' cDNA sequence for J 1-25 
s the determined 5' cDNA sequence for Jl-25 
s the determined 5' cDNA sequence for J 1-24 
s the determined 3' cDNA sequence for J 1-24 
s the determined 5' cDNA sequence for Kl-58 
s the determined 3' cDNA sequence for Kl-58 
s the determined 5' cDNA sequence for Kl-63 
s the determined 3' cDNA sequence for Kl-63 
s the determined 5' cDNA sequence for LI -4 
s the determined 3' cDNA sequence for LI -4 
s the determined 5' cDNA sequence for LI -14 
s the determined 3' cDNA sequence for LI -14 
s the determined 3' cDNA sequence for J 1-1 2 
s the determined 3' cDNA sequence for J1-I6 
s the determined 3' cDNA sequence for J 1-21 
s the determined 3' cDNA sequence for Kl-48 
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SEQ ID NO: 34 is the determined 3' cDNA sequence for Kl-55 
SEQ ID NO: 35 is the determined 3' cDNA sequence for LI -2 
SEQ ID NO: 36 is the determined 3' cDNA sequence for LI -6 
SEQ IL^ NO: 37 is the determined 3' cDNA sequence for Nl-1 858 
SEQ ID NO: 38 is the determined 3' cDNA sequence for Nl-1 860 
SEQ ID NO: 39 is the determined 3' cDNA sequence for Nl-1 861 
SEQ ID NO: 40 is the determined 3' cDNA sequence for Nl- 1864 
SEQ ID NO: 41 is the determined cDNA sequence for P5 
SEQ ID NO: 42 is the determined cDNA sequence for P8 
SEQ ID NO: 43 is the determined cDNA sequence for P9 
SEQ ID NO: 44 is the detennined cDNA sequence for PI 8 
SEQ ID NO: 45 is the determined cDNA sequence for P20 
SEQ ID NO: 46 is the determined cDNA sequence for P29 
SEQ ID NO: 47 is the determined cDNA sequence for P30 
SEQ ID NO: 48 is the detennined cDNA sequence for P34 
SEQ ID NO: 49 is the determined cDNA sequence for P36 
SEQ ID NO: 50 is the determined cDNA sequence for P38 
SEQ ID NO: 5] is the determined cDNA sequence for P39 
SEQ ID NO: 52 is the determined cDNA sequence for P42 
SEQ ID NO: 53 is the determined cDNA sequence for P47 
SEQ ID NO: 54 is the determined cDNA sequence for P49 
SEQ ID NO: 55 is the determined cDNA sequence for P50 
SEQ ID NO: 56 is the determined cDNA sequence for P53 
SEQ ID NO: 57 is the determined cDNA sequence for P55 
SEQ ID NO: 58 is the detennined cDNA sequence for P60 
SEQ ID NO: 59 is the determined cDNA sequence for P64 
SEQ ID NO: 60 is the detennined cDNA sequence for P65 
SEQ ID NO: 61 is the detennined cDNA sequence for P73 
SEQ ID NO: 62 is the determined cDNA sequence for P75 
SEQ ID NO: 63 is the detennined cDNA sequence for P76 
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to as P704P) 



10 



15 



20 



25 



SEQ ID NO: 64 is the determined cDNA sequence for P79 
SEQ ID NO: 65 is the determined cDNA sequence for P84 
SEQ ID NO: 66 is the determined cDNA sequence for P68 
SEQ ID NO: 67 is the determined cDNA sequence for P80 (also referred 



30 



SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 



68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
92 



the determined cDNA sequence for P82 
the deteiTnined cDNA sequence for U 1-3064 
the determined cDNA sequence for Ul-3065 
the determined cDNA sequence for VI -3692 
the determined cDNA sequence for 1 A-3905 
the determined cDNA sequence for VI -3686 
the determined cDNA sequence for Rl -2330 
s the determined cDNA sequence for lB-3976 
s the determined cDNA sequence for VI -3679 
s the determined cDNA sequence for lG-4736 
s the determined cDNA sequence for lG-4738 
s the determined cDNA sequence for lG-4741 
s the determined cDNA sequence for lG-4744 
s the determined cDNA sequence for lG-4734 
s the determined cDNA sequence for lH-4774 
s the determined cDNA sequence for lH-4781 
s the determined cDNA sequence for lH-4785 
s the determined cDNA sequence for lH-4787 
is the determined cDNA sequence for lH-4796 
s the determined cDNA sequence for 11-4807 
s the determined cDNA sequence for 11-4810 
s the determined cDNA sequence for 11-481 1 
s the determined cDNA sequence for lJ-4876 
s the determined cDNA sequence for lK-4884 
s the determined cDNA sequence for lK-4896 
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SEQ ID NO 
SHQIDNO 
SEQ ID NO 
SEQ ID NO: 
^ SEQ ID NO: 

SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
SEQ ID NO: 
10 SEQ ID NO: 

SEQ ID NO: 
SEQ) ID NO: 
SEQ ID NO: 
SEQ ID NO: 
15 S[';QIDN0: 
(also referred lo as P504S) 

SEQ ID NO: 108 is the predicted amino acid sequence for F1-I2 
SEQ ID NO: 109 is the determined full length cDNA sequence for Jl-17 
SEQ ID NO: 1 10 is the determined full length cDNA sequence for LI -12 
20 (also referred to as P501S) 

SEQ ID NO: 1 11 is the determined full length cDNA sequence for Nl- 
1862 (also referred to as P503S) 

SEQ ID NO: 1 12 is the predicted amino acid sequence for Jl-1 7 
SEQ ID NO: 113 is the predicted amino acid sequence for LI-12 (also 
25 referred to as P50 IS) 

SEQ ID NO: 11 4 is the predicted amino acid sequence for N 1-1 862 (also 
referred to as P503S) 

SEQ ID NO: 1 15 is the detemiined cDNA sequence for P89 
SEQ ID NO: 116 is the determined cDNA sequence for P90 
30 SEQ ID NO: 1 1 7 is the determined cDNA sequence for P92 



: 93 is the detennined cDNA sequence for lG-4761 
: 94 is the detennined cDNA sequence for lG-4762 
: 95 is the determined cDNA sequence for lH-4766 
: 96 is the determined cDNA sequence for lH-4770 
: 97 is the detemiined cDNA sequence for lH-477] 
: 98 is the detemiined cDNA sequence for lH-4772 

99 is the determined cDNA sequence for lD-4297 

100 is the determined cDNA sequence for lD-4309 

101 is the determined cDNA sequence for ID. 1-4278 

102 is the determined cDNA sequence for lD-4288 

103 is the determined cDNA sequence for ID-4283 

104 is the determined cDNA sequence for lD-4304 

105 is the determined cDNA sequence for lD-4296 

106 is the determined cDNA sequence for lD-4280 

107 is the determined full length cDNA sequence for Fl-12 
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SEQ ID NO: 


1 18 




SEQ ID NO: 


1 19 




SEQ ID NO: 


120 




SEQ ID NO: 


121 


5 


SEQ ID NO: 


122 




SEQ ID NO: 


123 




SEQ ID NO: 


124 




SEQ ID NO: 


125 




SEQ ID NO: 


126 


10 


SEQ ID NO: 


127 




SEQ ID NO: 


128 




SEQ ID NO: 


129 




SEQ ID NO: 


130 




SEQ ID NO: 


131 


15 


SEQ ID NO: 


132 




SEQ ID NO: 


133 




SEQ ID NO: 


134 




SEQ ID NO: 


135 




SEQ ID NO: 


136 


20 


SEQ ID NO: 


137 




SEQ ID NO; 


138 




SEQ ID NO: 


139 




SEQ ID NO: 


140 




SEQ ID NO: 


141 


25 


SEQ ID NO: 


142 




SEQ ID NO: 


143 




SEQ ID NO: 


144 




SEQ ID NO: 


145 




SEQ ID NO: 


146 


30 


SEQ ID NO: 


147 



s the determined cDNA sequence for P95 
s the detennined cDNA sequence for P98 
is the determined cDNA sequence for PI 02 
iS the detennined cDNA sequence for PI 10 
[s the determined cDNA sequence for PI 1 1 
IS the determined cDNA sequence for PI 14 
is the determined cDNA sequence for Pll 5 
is the determined cDNA sequence for P116 
is the determined cDNA sequence for PI 24 
is the determined cDNA sequence for PI 26 
is the determined cDNA sequence for PI 30 
is the determined cDNA sequence for PI 33 
is the determined cDNA sequence for PI 38 
is the determined cDNA sequence for PI 43 
is the determined cDNA sequence for PI 5 1 
is the determined cDNA sequence for PI 56 
is the determined cDNA sequence for PI 57 
is the determined cDNA sequence for PI 66 
is the determined cDNA sequence for PI 76 
is the determined cDNA sequence for PI 78 
is the determined cDNA sequence for PI 79 
is the determined cDNA sequence for PI 85 
is the detennined cDNA sequence for PI 92 
is the determined cDNA sequence for P201 
is the determined cDNA sequence for P204 
is the determined cDNA sequence for P208 
is the determined cDNA sequence for P21 1 
is the determined cDNA sequence for P213 
is the determined cDNA sequence for P219 
is the determined cDNA sequence for P237 
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SEQ ID NO: 148 
SHQ ID NO: 149 
SEQ ID NO: 150 
SEQ ID NO: 151 
SEQ ID NO: 152 
SEQ ID NO: 153 
SEQ ID NO: 154 
SEQ ID NO: 155 
SEQ ID NO: 156 
SEQ ID NO: 157 
SEQ ID NO: 158 
SEQ ID NO: 159 
SEQ ID NO: 160 
SEQ ID NO: 161 
SEQ ID NO: 162 
SEQ ID NO: 163 
SEQ ID NO: 164 
SEQ ID NO: 165 
SEQ ID NO: 166 
SEQ ID NO: 167 
SEQ ID NO: 168 
SEQ ID NO: 169 
SEQ ID NO: 170 
SEQ ID NO: 171 
SEQ ID NO: 172 
SEQ ID NO: ] 73 
SEQ ID NO: 174 
SEQ ID NO; 175 
SEQ ID NO: ] 76 
SEQ ID NO: 177 



s the determined cDNA sequence for P239 

s the determined cDNA sequence for P248 

s the determined cDNA sequence for P25 1 

s the determined cDNA sequence for P255 

s the determined cDNA sequence for P256 

s the determined cDNA sequence for P259 

s the determined cDNA sequence for P260 

s the determined cDNA sequence for P263 

s the determined cDNA sequence for P264 

s the determined cDNA sequence for P266 

s the determined cDNA sequence for P270 

s the detennined cDNA sequence for P272 

s the determined cDNA sequence for P278 

s the determined cDNA sequence for PI 05 

s the determined cDNA sequence for PI 07 

s the determined cDNA sequence for PI 37 

s the determined cDNA sequence for PI 94 

s the determined cDNA sequence for PI 95 

s the determined cDNA sequence for PI 96 

s the determined cDNA sequence for P220 

s the determined cDNA sequence for P234 

s the determined cDNA sequence for P235 

s the determined cDNA sequence for P243 

s the determined cDNA sequence for P703P-DE1 

s the predicted amino acid sequence for P703P-DEI 

s the determined cDNA sequence for P703P-DE2 

3 the determined cDNA sequence for P703P-DE6 

5 the determined cDNA sequence for P703P-DE13 

; the predicted amino acid sequence for P703P-DEI3 

; the determined cDNA sequence for P703P-DE14 
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SEQ ID NO: 178 is the predicted amino acid sequence for P703P-DE14 
SEQ ID NO: 179 is the determined extended cDNA sequence for IG- 



4736 

5 4738 
4741 
4744 
4774 
4781 
15 4785 
4787 
4796 
4807 



SEQ ID NO: 180 is the determined extended cDNA sequence for IG- 

SEQ ID NO: 181 is the determined extended cDNA sequence for IG- 

SEQ ID NO: 182 is the determined extended cDNA sequence for IG- 

10 SEQ ID NO: 183 is the determined extended cDNA sequence for IH- 

SEQ ID NO: 184 is the determined extended cDNA sequence for IH- 

SEQ ID NO: 185 is the determined extended cDNA sequence for IH- 

SEQ ID NO: 186 is the determined extended cDNA sequence for IH- 

SEQ ID NO: 187 is the determined extended cDNA sequence for IH- 

20 SEQ ID NO: 188 is the determined extended cDNA sequence for II- 

SEQ ID NO: 1 89 is the determined 3' cDNA sequence for 11-481 0 
SEQ ID NO: 190 is the determined 3' cDNA sequence for 11-481 1 
SEQ ID NO: 191 is the determined extended cDNA sequence for IJ- 

25 4876 

SEQ ID NO: 192 is the determined extended cDNA sequence for IK- 

4884 

SEQ ID NO: 193 is the determined extended cDNA sequence for IK- 

4896 
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SEQ ID NO: 194 is the determined extended cDNA sequence for IG- 

SEQ ID NO: 195 is the determined extended cDNA sequence for IG- 

SHQ ID NO: 196 is the determined extended cDNA sequence for IH- 

SEQ ID NO: 197 is the determined 3' cDNA sequence for lH-4770 
SEQ ID NO: 198 is the determined 3' cDNA sequence for lH-477] 
SEQ ID NO: 199 is the determined extended cDNA sequence for 111- 

SEQ ID NO: 200 is the determined extended cDNA sequence for ID- 

SEQ ID NO: 201 is the determined extended cDNA sequence for ID.l- 

SEQ ID NO: 202 is the determined extended cDNA sequence for ID- 

SEQ ID NO: 203 is the determined extended cDNA sequence for ID- 

SEQ ID NO: 204 is the determined extended cDNA sequence for ID- 

SEQ ID NO: 205 is the determined extended cDNA sequence for ID- 

SEQ ID NO: 206 is the determined extended cDNA sequence for ID- 

SEQ ID NO: 207 is the determined cDNA sequence for 10-d8fwd 
SEQ ID NO: 208 is the determined cDNA sequence for 10-HlOcon 
SEQ ID NO: 209 is the determined cDNA sequence for ll-C8rev 
SEQ ID NO: 210 is the determined cDNA sequence for 7.g6f\vd 
SEQ ID NO: 21 1 is the determined cDNA sequence for 7.g6rev 
SEQ ID NO: 212 is the determined cDNA sequence for 8-b5f\vd 
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SEQ ID NO: 213 is the determined cDNA sequence for 8-b5rev 
SEQ ID NO: 214 is the determined cDNA sequence for 8-b6fwd 
SEQ ID NO: 21 5 is the determined cDNA sequence for 8-b6 rev 
SEQ ID NO: 216 is the determined cDNA sequence for 8-d4fwd 
SEQ ID NO: 217 is the determined cDNA sequence for 8-d9rev 
SEQ ID NO: 218 is the determined cDNA sequence for 8-g3fwd 
SEQ ID NO: 219 is the determined cDNA sequence for 8-g3rev 
SEQ ID NO: 220 is the detennined cDNA sequence for 8-hl Irev 
SEQ ID NO: 221 is the determined cDNA sequence for g-fl2fwd 
SEQ ID NO: 222 is the determined cDNA sequence for g-f3rev 
SEQ ID NO: 223 is the determined cDNA sequence for P509S 
SEQ ID NO: 224 is the determined cDNA sequence for P510S 
SEQ ID NO: 225 is the determined cDNA sequence for P703DE5 
SEQ ID NO: 226 is the determined cDNA sequence for 9-Al 1 
SEQ ID NO; 227 is the determined cDNA sequence for 8-C6 
SEQ ID NO: 228 is the determined cDNA sequence for 8-H7 
SEQ ID NO: 229 is the determined cDNA sequence for JPTPN13 
SEQ ID NO: 230 is the determined cDNA sequence for JPTPN14 
SEQ ID NO: 231 is the determined cDNA sequence for JPTPN23 
SEQ ID NO: 232 is the determined cDNA sequence for JPTPN24 
SEQ ID NO: 233 is the determined cDNA sequence for JPTPN25 
SEQ ID NO: 234 is the determined cDNA sequence for JPTPN30 
SEQ ID NO: 235 is the determined cDNA sequence for JPTPN34 
SEQ ID NO: 236 is the determined cDNA sequence for PTPN35 
SEQ ID NO: 237 is the determined cDNA sequence for JPTPN36 
SEQ ID NO: 238 is the determined cDNA sequence for JPTPN38 
SEQ ID NO: 239 is the determined cDNA sequence for JPTPN39 
SEQ ID NO: 240 is the determined cDNA sequence for JPTPN40 
SEQ ID NO: 241 is the determined cDNA sequence for JPTPN41 
SEQ ID NO: 242 is the determined cDNA sequence for JPTPN42 
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SEQ ID NO: 243 is the determined cDNA sequence for JPTPN45 
SEQ ID NO: 244 is the determined cDNA sequence for JPTPN46 
SEQ ID NO: 245 is the detennined cDNA sequence for JPTPN51 
SEQ ID NO: 246 is the detemiined cDNA sequence for JP1PN56 
SEQ ID NO: 247 is the determined cDNA sequence forPTPN64 
SEQ ID NO: 248 is the determined cDNA sequence for JPTPN65 
SEQ ID NO: 249 is the determined cDNA sequence for JPTPN67 
SEQ ID NO: 250 is the determined cDNA sequence for JPTPN76 
SEQ ID NO: 251 is the determined cDNA sequence for JPrPN84 
SEQ ID NO: 252 is the detemiined cDNA sequence for JPTPN85 
SEQ ID NO: 253 is the determined cDNA sequence for JPrPN86 
SEQ ID NO: 254 is the detemiined cDNA sequence for JPTPN87 
SEQ ID NO: 255 is the determined cDNA sequence for JPTPN88 
SEQ ID NO: 256 is the determined cDNA sequence for JPIFI 
SEQ ID NO: 257 is the determined cDNA sequence for JP1F2 
SEQ ID NO: 258 is the determined cDNA sequence for JP1C2 
SEQ ID NO: 259 is the determined cDNA sequence for JPIBI 
SEQ ID NO: 260 is the detennined cDNA sequence for JP1B2 
SEQ ID NO: 261 is the determined cDNA sequence for JP1D3 
SEQ ID NO: 262 is the determined cDNA sequence for JPl A4 
SEQ ID NO: 263 is the determined cDNA sequence for JP1F5 
SEQ ID NO: 264 is the determined cDNA sequence for JP1E6 
SEQ ID NO: 265 is the determined cDNA sequence for JP1D6 
SEQ ID NO: 266 is the determined cDNA sequence for JP1B5 
SEQ ID NO: 267 is the determined cDNA sequence for JPl A6 
SEQ ID NO: 268 is the determined cDNA sequence for JP1E8 
SEQ ID NO: 269 is the determined cDNy\ sequence for JP1D7 
SEQ ID NO: 270 is the determined cDNA sequence for JP1D9 
SEQ ID NO: 271 is the detemiined cDNA sequence for JPl CIO 
SEQ ID NO: 272 is the determined cDNA sequence for JPl A9 
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SEQ ID NO: 273 is the determined cDNA sequence for JP1F12 
SEQ ID NO: 274 is the determined cDNA sequence for JP1E12 
SEQ ID NO: 275 is the determined cDNA sequence for JPIDI 1 
SEQ ID NO: 276 is the determined cDNA sequence for JPICI 1 
SEQ ID NO: 277 is the determined cDNA sequence for JP1C12 
SEQ ID NO: 278 is the determined cDNA sequence for JP1B12 
SEQ ID NO: 279 is the determined cDNA sequence for JPl A12 
SEQ ID NO: 280 is the determined cDNA sequence for JP8G2 
SEQ ID NO: 281 is the determined cDNA sequence for JP8H1 
SEQ ID NO: 282 is the determined cDNA sequence for JP8H2 
SEQ ID NO: 283 is the determined cDNA sequence for JP8A3 
SEQ ID NO: 284 is the determined cDNA sequence for JP8A4 
SEQ ID NO: 285 is the determined cDNA sequence for JP8C3 
SEQ ID NO: 286 is the determined cDNA sequence for JP8G4 
SEQ ID NO: 287 is the determined cDNA sequence for JP8B6 
SEQ ID NO: 288 is the determined cDNA sequence for JP8D6 
SEQ ID NO: 289 is the determined cDNA sequence for JP8F5 
SEQ ID NO: 290 is the determined cDNA sequence for JP8A8 
SEQ ID NO: 291 is the determined cDNA sequence for JP8C7 
SEQ ID NO: 292 is the determined cDNA sequence for JP8D7 
SEQ ID NO: 293 is the determined cDNA sequence for P8D8 
SEQ ID NO: 294 is the determined cDNA sequence for JP8E7 
SEQ ID NO: 295 is the determined cDNA sequence for JP8F8 
SEQ ID NO: 296 is the determined cDNA sequence for JP8G8 
SEQ ID NO: 297 is the determined cDNA sequence for JP8B10 
SEQ ID NO: 298 is the determined cDNA sequence for JP8C10 
SEQ ID NO: 299 is the determined cDNA sequence for JP8E9 
SEQ ID NO: 300 is the determined cDNA sequence for JP8E10 
SEQ ID NO: 301 is the determined cDNA sequence for JP8F9 
SEQ ID NO: 302 is the determined cDNA sequence for JP8H9 
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SEQ ID NO: 303 is the determined cDNA sequence for JP8C12 
SEQ ID NO: 304 is the determined cDNA sequence for JP8E1 1 
SEQ ID NO: 305 is the determined cDNA sequence for JP8E12 
SEQ ID NO: 306 is the amino acid sequence for the peptide PS2#I2 
SEQ ID NO: 307 is the determined cDNA sequence for P711P 
SEQ ID NO: 308 is the determined cDNA sequence for P712P 
SEQ ID NO: 309 is the determined cDNA sequence for CLONE23 
SEQ ID NO: 310 is the detennined cDNA sequence for P774P 
SEQ ID NO: 311 is the determined cDNA sequence for P775P 
0 SEQ ID NO: 312 is the determined cDNA sequence for P71 5P 

SEQ ID NO: 313 is the determined cDNA sequence for P710P 
SEQ ID NO: 314 is the determined cDNA sequence for P767P 
SEQ ID NO: 315 is the determined cDNA sequence for P768P 
SEQ ID NO: 316-325 are the detennined cDNA sequences of previously 

5 isolated genes 

SEQ ID NO: 326 is the determined cDNA sequence for P703PDE5 
SEQ ID NO: 327 is the predicted amino acid sequence for P703PDE5 
SEQ ID NO: 328 is the determined cDNA sequence for P703P6.26 
SEQ ID NO: 329 is the predicted amino acid sequence for P703P6.26 
SEQ ID NO: 330 is the determined cDNA sequence for P703PX-23 
SEQ ID NO: 33 1 is the predicted amino acid sequence for P703PX-23 
SEQ ID NO: 332 is the determined full length cDNA sequence for 

P509S 

SEQ ID NO: 333 is the determined extended cDNA sequence for P707P 
(also refeiTed to as 11 -C9) 

SEQ ID NO: 334 is the determined cDNA sequence for P714P 
SEQ ID NO: 335 is the determined cDNA sequence for P705P (also 
referred to as 9-F3) 

SEQ ID NO: 336 is the predicted amino acid sequence for P705P 
SEQ ID NO: 337 is the amino acid sequence of the peptide P1S#10 
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SEQ ID NO: 338 is the amino acid sequence of the peptide p5 
SEQ ID NO: 339 is the predicted amino acid sequence of P509S 
SEQ ID NO: 340 is the determined cDNA sequence for P778P 
SEQ ID NO: 341 is the determined cDNA sequence for P786P 
5 SEQ ID NO: 342 is the determined cDNA sequence for P789P 

SEQ ID NO: 343 is the determined cDNA sequence for a clone showing 

homology to Homo sapiens MM46 mRNA 

SEQ ID NO: 344 is the determined cDNA sequence for a clone showing 

homology to Homo sapiens TNF-alpha stimulated ABC protein (ABC50) mRNA 
10 SEQ ID NO: 345 is the determined cDNA sequence for a clone showing 

homology to Homo sapiens mRNA for E-cadherin 

SEQ ID NO: 346 is the determined cDNA sequence for a clone showing 

homology to Human nuclear-encoded mitochondrial serine hydroxymethyltransferase 

(SHMT) 

15 SEQ ID NO: 347 is the determined cDNA sequence for a clone showing 

homology to Homo sapiens natural resistance-associated macrophage protein2 
(NRAMP2) 

SEQ ID NO: 348 is the determined cDNA sequence for a clone showing 
homology to Homo sapiens phosphoglucomutase-related protein (PGMRP) 

20 SEQ ID NO: 349 is the determined cDNA sequence for a clone showing 

homology to Human mRNA for proteosome subunit p40 

SEQ ID NO: 350 is the determined cDNA sequence for P777P 
SEQ ID NO: 351 is the determined cDNA sequence for P779P 
SEQ ID NO: 352 is the determined cDNA sequence for P790P 

25 SEQ ID NO: 353 is the determined cDNA sequence for P784P 

SEQ ID NO: 354 is the determined cDNA sequence for P776P 
SEQ ID NO: 355 is the determined cDNA sequence for P780P 
SEQ ID NO: 356 is the determined cDNA sequence for P544S 
SEQ ID NO: 357 is the determined cDNA sequence for P745S 

30 SEQ ID NO: 358 is the determined cDNA sequence for P782P 
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SEQ ID NO: 359 is the detennined cDNA sequence for P783P 
SEQ ID NO: 360 is the determined cDNA sequence for unknown 17984 
SEQ ID NO: 361 is the detemiined cDNA sequence for P787P 
SEQ ID NO: 362 is the determined cDNA sequence for P788P 
5 SEQ ID NO: 363 is the detemiined cDNA sequence for unknown 1 7994 

SEQ ID NO: 364 is the determined cDNA sequence for P781P 
SEQ ID NO: 365 is the determined cDNA sequence for P785P 
SEQ ID NO: 366-375 are the determined cDNA sequences for splice 
variants of B305D. 

'0 SEQ ID NO: 376 is the predicted amino acid sequence encoded by the 

sequence of SEQ ID NO: 366. 

SEQ ID NO: 377 is the predicted amino acid sequence encoded by the 
sequence of SEQ ID NO: 372. 

SEQ ID NO: 378 is the predicted amino acid sequence encoded by the 
1 5 sequence of SEQ ID NO: 373. 

SEQ ID NO: 379 is the predicted amino acid sequence encoded by the 
sequence of SEQ ID NO: 374. 

SEQ ID NO: 380 is the predicted amino acid sequence encoded by the 
sequence of SEQ ID NO: 375. 
20 SEQ ID NO: 381 is the determined cDNA sequence for 871 6P. 

SEQ ID NO: 382 is the determined full-length cDNA sequence for 

P711P. 

SEQ ID NO: 383 is the predicted amino acid sequence for P71 IP. 

SEQ ID NO: 384 is the cDNA sequence for PIOOOC. 
25 SEQ ID NO: 385 is the cDNA sequence for CGI-82. 

SEQ ID NO:386 is the cDNA sequence for 23320. 

SEQ ID NO:387 is the cDNA sequence for CGI-69. 

SEQ ID NO:388 is the cDNA sequence for L-iditol-2-dehydrogenase. 

SEQ ID NO:389 is the cDNA sequence for 23379. 
"^0 SEQ ID NO:390 is the cDNA sequence for 2338 1 , 
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SEQ ID NO:391 is the cDNA sequence for KIAA0122. 

SEQ ID NO:392 is the cDNA sequence for 23399. 

SEQ ID NO:393 is the cDNA sequence for a previously identified gene. 

SEQ ID NO:394 is the cDNA sequence for HCLBP. 

SEQ ID NO:395 is the cDNA sequence for transglutaminase. 

SEQ ID NO: 396 is the cDNA sequence for a previously identified gene. 

SEQ ID NO:397 is the cDNA sequence for PAP. 

SEQ ID NO:398 is the cDNA sequence for Ets transcription factor 

SEQ ID NO:399 is the cDNA sequence for hTGR. 

SEQ ID NO:400 is the cDNA sequence for KIAA0295. 

SEQ IDNO:401 is the cDNA sequence for 22545. 

SEQ ID NO:402 is the cDNA sequence for 22547. 

SEQ ID NO:403 is the cDNA sequence for 22548. 

SEQ ID NO:404 is the cDNA sequence for 22550. 

SEQ ID NO:405 is the cDNA sequence for 22551 . 

SEQ ID NO:406 is the cDNA sequence for 22552. 

SEQ ID NO:407 is the cDNA sequence for 22553 (also knoAvn as 



SEQ 


IDNO:408 


is the 


cDNA 


sequence 


for 22558. 


SEQ 


IDNO:409 


is the 


cDNA 


sequence 


for 22562. 


SEQ 


ID NO:410 


is the 


cDNA 


sequence 


for 22565. 


SEQ 


IDNO:411 


is the 


cDNA 


sequence 


for 22567. 


SEQ 


IDNO:412 


is the 


cDNA 


sequence 


for 22568. 


SEQ 


ID NO:413 


is the 


cDNA 


sequence 


for 22570. 


SEQ 


IDNO:414 


is the 


cDNA 


sequence 


for 22571. 


SEQ 


ID NO:415 


is the 


cDNA 


sequence 


for 22572. 


SEQ 


IDNO:416 


is the 


cDNA 


sequence 


for 22573. 


SEQ 


ID NO:417 


is the 


cDNA 


sequence 


for 22573. 


SEQ 


IDNO:418 


is the 


cDNA 


sequence 


for 22575. 
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SEQIDNO:4I9 
ShQ ID NO:420 
SEQIDNO:42] 
SEQIDNO:422 
SEQIDNO:423 
SEQ IDNO:424 
SEQ ID NO:425 
SHQ IDNO:426 
SHQ JDNO:427 
SEQ IDNO:428 
SEQlDNO:429 
SEQ IDNO:430 
SEQ lDNO:43] 
SEQ lDNO:432 
SEQ IDNO:433 
SEQlDNO:434 
SEQIDNO:435 
SEQlDNO:436 
SEQ IDNO:437 
SEQIDNO:438 
SEQ IDNO:439 
SEQ ID NO:440 
SEQ IDNO:441 
SEQ IDNO:442 
SEQ IDNO:443 
SEQ ID NO:444 
SEQIDNO:445 
SEQ IDNO:446 
SEQ ID NO:447 
SEQ IDNO:448 



s the cDNA sequence for 22580. 
s the cDNA sequence for 22581. 
s the cDNA sequence for 22582. 
s the cDNA sequence for 22583. 
s the cDNA sequence for 22584. 
s the cDNA sequence for 22585. 
s the cDNA sequence for 22586. 
s the cDNA sequence for 22587. 
s the cDNA sequence for 22588. 
s the cDNA sequence for 22589. 
s the cDNA sequence for 22590. 
s the cDNA sequence for 22591. 
s the cDNA sequence for 22592. 
s the cDNA sequence for 22593. 
s the cDNA sequence for 22594. 
s the cDNA sequence for 22595. 
s the cDNA sequence for 22596. 
s the cDNA sequence for 22847. 
s the cDNA sequence for 22848. 
s the cDNA sequence for 22849. 
s the cDNA sequence for 2285 E 
s the cDNA sequence for 22852. 
s the cDNA sequence for 22853. 
s the cDNA sequence for 22854. 
s the cDNA sequence for 22855. 
s the cDNA sequence for 22856. 
s the cDNA sequence for 22857. 
s the cDNA sequence for 23601 . 
s the cDNA sequence for 23602. 
s the cDNA sequence for 23605. 
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SEQ IDNO:449 
SEQ ID NO:450 
SEQ IDNO:45] 
SEQ ID NO;452 
SEQ ID NO:453 
SEQ IDNO:454 
SEQ ID NO:455 
SEQ ID NO:456 
SEQ ID NO:457 
SEQ ID NO:458 
SEQ ID NO:459 
SEQ ID NO:460 
SEQ IDNO:461 



s the cDNA sequence for 23606. 
s the cDNA sequence for 23612. 
s the cDNA sequence for 23614, 
s the cDNA sequence for 2361 8. 
s the cDNA sequence for 23622. 
s the cDNA sequence for folate hydrolase, 
s the cDNA sequence for LIM protein, 
s the cDNA sequence for a known gene, 
s the cDNA sequence for a known gene. 
1 0 SEO ID NO:458 is the cDNA sequence for a previously identified gene. 

s the cDNA sequence for 23045. 
s the cDNA sequence for 23032. 
s the cDNA sequence for clone 23054. 
SEQ ID NO:462-467 are cDNA sequences for known genes. 
1 5 SEQ ID NO:468-471 are cDNA sequences for P7I0P. 

SEQ ID NO:472 is a cDNA sequence for PIOOIC. 

SEQ ID NO: 473 is the determined cDNA sequence for a first splice 
variant of P775P (referred to as 27505). 

SEQ ID NO: 474 is the determined cDNA sequence for a second splice 
20 variant of P775P (referred to as 1 9947). 

SEQ ID NO: 475 is the determined cDNA sequence for a third splice 
variant of P775P (referred to as 19941). 

SEQ ID NO: 476 is the determined cDNA sequence for a fourth splice 
variant of P775P (referred to as 19937). 
25 SEQ ID NO: 477 is a first predicted amino acid sequence encoded by the 

sequence of SEQ ID NO: 474. 

SEQ ID NO: 478 is a second predicted amino acid sequence encoded by 
the sequence of SEQ ID NO: 474. 

SEQ ID NO: 479 is the predicted amino acid sequence encoded by the 
30 sequence of SEQ ID NO: 475. 
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SEQ ID NO: 480 is a first predicted amino acid sequence encoded by the 
sequence of SEQ ID NO: 473. 

SEQ ID NO; 481 is a second predicted amino acid sequence encoded by 
the sequence of SEQ ID NO: 473. 

^EQ ID NO: 482 is a third predicted amino acid sequence encoded by 
the sequence of SEQ ID NO: 473. 

SEQ ID NO: 483 is a fourth predicted amino acid sequence encoded by 
the sequence of SEQ ID NO: 473. 

SEQ ID NO: 484 is the first 30 amino acids of the M tuberculosis 

0 antigen Ra 12. 

SEQ ID NO: 485 is the PGR primer AW025. 
SEQ ID NO: 486 is the PGR primer AW003. 
SEQ ID NO: 487 is the PGR primer AW027. 
SEQ ID NO: 488 is the PGR primer AW026. 

SEQ ID NO: 489-501 are peptides employed in epitope mapping studies. 
SEQ ID NO: 502 is the determined cDNA sequence of the 
complementarity determining region for the anti-P503S monoclonal antibody 20D4. 

SEQ ID NO: 503 is the determined cDNA sequence of the 
complementarity determining region for the anti-P503S monoclonal antibody JAl. 

SEQ ID NO: 504 & 505 are peptides employed in epitope mapping 

studies. 

SEQ ID NO: 506 is the determined cDNA sequence of the 
complementarity determining region for the anti-P703P monoclonal antibody 8H2. 

SEQ ID NO: 507 is the determined cDNA sequence of the 
complementarit>^ detemiining region for the anti-P703P monoclonal antibody 7H8. 

SEQ ID NO: 508 is the determined cDNA sequence of the 
complementarity' determining region for the anti-P703P monoclonal antibody 2D4. 

SEQ ID NO: 509-522 arc peptides employed in epitope mapping studies. 
SEQ ID NO: 523 is a mature fonn of P703P used to raise antibodies 

against P703P. 
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SEQ ID NO: 524 is the putative full-length cDNA sequence of P703P. 
SEQ ID NO: 525 is the predicted amino acid sequence encoded by SEQ 

SEQ ID NO: 526 is the ftill-length cDNA sequence for P790P. 
SEQ ID NO; 527 is the predicted amino acid sequence for P790P. 
SEQ ID NO: 528 & 529 are PGR primers. 

SEQ ID NO: 530 is the cDNA sequence of a splice variant of SEQ ID 



NO: 366. 



SEQ ID NO: 531 is the cDNA sequence of the open reading frame of 
10 SEQ ID NO: 530. 

SEQ) ID NO: 532 is the predicted amino acid encoded by the sequence of 
SEQ ID NO: 531. 

SEQ ID NO: 533 is the DNA sequence of a putative ORF of P775P, 
SEQ ID NO: 534 is the predicted amino acid sequence encoded by SEQ 

15 ID NO: 533. 

SEQ ID NO: 535 is a first full-length cDNA sequence for P510S. 
SEQ ID NO: 536 is a second full-length cDNA sequence for P510S. 
SEQ ID NO: 537 is the predicted amino acid sequence encoded by SEQ 

ID NO: 535. 

20 SEQ ID NO: 538 is the predicted amino acid sequence encoded by SEQ 

ID NO: 536. 

SEQ ID NO: 539 is the peptide P501S-370. 

SEQ ID NO: 540 is the peptide P501 S-376. 

SEQ ID NO: 541-551 are epitopes of P501S. 
25 SEQ ID NO: 552 is an extended cDNA sequence for P712P. 

SEQ ID NO: 553-568 are the amino acid sequences encoded by 
predicted open reading frames within SEQ ID NO: 552. 

SEQ ID NO: 569 is an extended cDNA sequence for P776P. 

SEQ ID NO: 570 is the determined cDNA sequence for a splice variant 
30 of P776P referred to as contig 6. 
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SEQ ID NO: 571 is the detemiined cDNA sequence for a splice variant 
of P776P referred to as contig 7. 

SEQ ID NO: 572 is the detennined cDNA sequence for a splice variant 
of P776P referred to as contig 14. 
5 SEQ ID NO: 573 is the amino acid sequence encoded by a first predicted 

ORE' of SEQ ID NO: 570. 

SEQ ID NO: 574 is the amino acid sequence encoded by a second 
predicted ORE of SEQ ID NO: 570. 

SEQ ID NO: 575 is the amino acid sequence encoded by a predicted 
10 ORE of SEQ ID NO: 571. 

SEQ ID NO: 576-586 are amino acid sequences encoded by predicted 
ORFs of SEQ ID NO: 569. 

SEQ ID NO: 587 is a DNA consensus sequence of the sequences of 
P767P and P777P. 

^5 SEQ ID NO: 588-590 are amino acid sequences encoded by predicted 

ORFs ofSEQIDNO: 587. 

SEQ ID NO: 591 is an extended cDNA sequence for P1020C. 

SEQ ID NO: 592 is the predicted amino acid sequence encoded by the 
sequence of SEQ ID NO: P1020C. 
^0 SEQ ID NO: 593 is a splice variant of P775P referred to as 50748. 

SEQ ID NO: 594 is a splice variant of P775P referred to as 50717. 

SEQ ID NO: 595 is a splice variant of P775P referred to as 45985. 

SEQ ID NO: 596 is a splice variant of P775P referred to as 38769. 

SEQ ID NO: 597 is a splice variant of P775P referred to as 37922. 

SEQ ID NO: 598 is a splice variant of P510S referred to as 49274. 

SEQ ID NO: 599 is a splice variant of P510S referred to as 39487. 

SEQ ID NO: 600 is a splice variant of P504S referred to as 5167.16. 

SEQ ID NO: 601 is a splice variant of P504S referred to as 5167.1. 

SEQ ID NO: 602 is a splice variant of P504S referred to as 5163.46. 
'0 SEQ ID NO: 603 is a splice variant of P504S referred to as 5163.42. 
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SEQ ID NO: 604 is a splice variant of P504S referred to as 5163.34. 
SEQ ID NO: 605 is a splice variant of P504S referred to as 5163.17. 
SEQ ID NO: 606 is a splice variant of P501S referred to as 10640. 
SEQ ID NO: 607-615 are the sequences of PGR primers. 
5 SEQ ID NO: 616 is the determined cDNA sequence of a fusion of P703P 

and PS A- 

SEQ ID NO: 617 is the amino acid sequence of the fusion of P703P and 

PSA. 

SEQ ID NO: 618 is the cDNA sequence of the gene DD3. 
10 SEQ ID NO: 619 is an extended cDNA sequence for P71 4P. 

SEQ ID NO: 620-622 are the cDNA sequences for splice variants of 

P704P. 

SEQ ID NO: 623 is the cDNA sequence of a splice variant of P553S 
referred to as P553S-14. 
15 SEQ ID NO: 624 is the cDNA sequence of a splice variant of P553S 

referred to as P553S-12, 

SEQ ID NO: 625 is the cDNA sequence of a splice variant of P553S 
referred to as P553S-10. 

SEQ ID NO: 626 is the cDNA sequence of a splice variant of P553S 
20 referred to as P553S-6. 

SEQ ID NO: 627 is the amino acid sequence encoded by SEQ ID NO: 

626. 

SEQ ID NO: 628 is a first amino acid sequence encoded by SEQ ID NO: 

623. 

25 SEQ ID NO: 629 is a second amino acid sequence encoded by SEQ ID 

NO: 623. 

SEQ ID NO: 630 is a first full-length cDNA sequence for prostate- 
specific transglutaminase gene (also referred to herein as P558S). 

SEQ ID NO: 631 is a second full-length cDNA sequence for prostate- 
30 specific transglutaminase gene. 
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SEQ ID NO: 632 is the amino acid sequence encoded by the sequence of 
S1:QIDNO:630. 

SEQ ID NO: 633 is the amino acid sequence encoded by the sequence of 
SEQ ID NO: 631. 

^ SEQ ID NO: 634 is the full-length cDNA sequence for P788P. 

SEQ ID NO: 635 is the amino acid sequence encoded by SEQ ID NO: 

634. 

SEQ ID NO: 636 is the determined cDNA sequence for a polymorphic 
variant of P788P. 

SEQ ID NO: 637 is the amino acid sequence encoded by SEQ ID NO: 

636. 

SEQ ID NO: 638 is the amino acid sequence of peptide 4 from P703P. 
SEQ ID NO: 639 is the cDNA sequence that encodes peptide 4 from 

P703P. 

^ ^ SEQ ID NO: 640-655 are cDNA sequences encoding epitopes of P703P. 

SEQ ID NO: 656-671 are the amino acid sequences of epitopes of 

P703P. 

SEQ ID NO: 672 and 673 are PGR primers. 

SEQ ID NO: 674 is the cDNA sequence encoding an N-terminal portion 
lii of P788P expressed in E, coli, 

SEQ ID NO: 675 is the amino acid sequence of the N-terminal portion of 
P788P expressed in £. coli. 

SEQ ID NO: 676 is the amino acid sequence of the Ad. tuberculosis 

antigen Ral2. 

25 SEQ ID NO: 677 and 678 are PGR primers. 

SEQ ID NO: 679 is the cDNA sequence for the Ral2-P510S-C 

constmct. 

SEQ ID NO: 680 is the cDNA sequence for the P510S-C construct. 
SEQ ID NO: 681 is the cDNA sequence for the P51 0S-E3 construct. 
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SEQ ID NO: 682 is the amino acid sequence for the Ral2-P510S-C 

construct. 

SEQ ID NO: 683 is the amino acid sequence for the P51 OS-C construct. 
SEQ ID NO: 684 is the amino acid sequence for the P51 0S-E3 construct. 
5 SEQ ID NO: 685-690 are PGR primers. 

SEQ ID NO: 691 is the cDNA sequence of the construct Ral2-P775P- 

0RF3. 

SEQ ID NO: 692 is the amino acid sequence of the construct Ral2- 

P775P-ORF3. 

1 0 SEQ ID NO: 693 and 694 are PGR primers. 

SEQ ID NO: 695 is the determined amino acid sequence for a P703P His 
tag fusion protein. 

SEQ ID NO: 696 is the determined cDNA sequence for a P703P His tag 

fusion protein. 

1 5 SEQ ID NO: 697 and 698 are PGR primers. 

SEQ ID NO: 699 is the determined amino acid sequence for a P705P His 
tag fusion protein. 

SEQ ID NO: 700 is the determined cDNA sequence for a P705P His tag 

fusion protein. 

20 SEQ ID NO: 701 and 702 are PGR primers. 

SEQ ID NO: 703 is the determined amino acid sequence for a P71 IP His 
tag fusion protein. 

SEQ ID NO: 704 is the determined cDNA sequence for a P71 IP His tag 

fusion protein. 

25 SEQ ID NO: 705 is the amino acid sequence of the M tuberculosis 

antigen Ral2. 

SEQ ID NO: 706 and 707 are PGR primers. 

SEQ ID NO: 708 is the determined cDNA sequence for the construct 
Ral2-P501S-E2. 
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SEQ ID NO: 709 is the determined amino acid sequence for the 
construct Ral2-P501S-E2. 

SEQ ID NO: 710 is the amino acid sequence for an epitope of P501S. 
SEQ ID NO: 71 1 is the DNA sequence encoding SEQ ID NO: 710. 
:^ SEQ ID NO: 712 is the amino acid sequence for an epitope of P501S. 

SEQ ID NO: 713 is the DNA sequence encoding SEQ ID NO: 712. 
SEQ ID NO: 714 is a peptide employed in epitope mapping studies. 
SEQ ID NO: 715 is the amino acid sequence for an epitope of P501S. 
SEQ ID NO: 716 is the DNA sequence encoding SEQ ID NO: 715. 
SEQ ID NO: 717-719 are the amino acid sequences for CD4 epitopes of 

P501S. 

SEQ ID NO: 720-722 are the DNA sequences encoding the sequences of 
SEQ ID NO: 717-719. 

SEQ ID NO: 723-734 are the amino acid sequences for putative CTL 
15 epitopes of P703P. 

SEQ ID NO: 735 is the full-length cDNA sequence for P789P. 

SEQ ID NO: 736 is the amino acid sequence encoded by SEQ ID NO: 

735. 

SEQ ID NO: 737 is the determined full-length cDNA sequence for the 
20 splice variant of P776P referred to as contig 6. 

SEQ ID NO: 738-739 are determined full-length cDNA sequences for 
the splice variant of P776P referred to as contig 7. 

SEQ ID NO: 740-744 are amino acid sequences encoded by SEQ ID NO: 

737. 

SEQ ID NO: 745-750 are amino acid sequences encoded by the splice 
variant of P776P referred to as contig 7. 

SEQ ID NO: 751 is the full-length cDNA sequence for human 
transmembrane protease serine 2. 

SEQ ID NO: 752 is the amino acid sequence encoded by SEQ ID NO: 

30 751. 



wo 01/51633 



PCT/lJSOl/01574 



34 

SEQ ID NO: 753 is the cDNA sequence encoding the first 209 amino 
acids of human transmembrane protease serine 2. 

SEQ ID NO: 754 is the first 209 amino acids of human transmembrane 
protease serine 2. 

5 SEQ ID NO: 755 is the amino acid sequence of peptide 296-322 of 

P501S. 

SEQ ID NO: 756-759 are PGR primers. 

SEQ ID NO: 760 is the determined cDNA sequence of the Vb chain of a 
T cell receptor for the P501 S-specific T cell clone 4E5. 
10 SEQ ID NO; 761 is the determined cDNA sequence of the Va chain of a 

T cell receptor for the P501 S-specific T cell clone 4E5. 

SEQ ID NO: 762 is the amino acid sequence encoded by SEQ ID NO 

760. 

SEQ ID NO: 763 is the amino acid sequence encoded by SEQ ID NO 

15 761. 

SEQ ID NO: 764 is the full-length open reading frame for P768P 
including stop codon. 

SEQ ID NO: 765 is the full-length open reading frame for P768P without 

stop codon. 

20 SEQ ID NO: 766 is the amino acid sequence encoded by SEQ ID NO; 

765. 

SEQ ID NO: 767-772 are the amino acid sequences for predicted 
domains of P768P. 

SEQ ID NO; 773 is the flill-length cDNA sequence of P835P. 
25 SEQ ID NO; 774 is the cDNA sequence of the previously identified 

clone FLJ13581. 

SEQ ID NO: 775 is the cDNA sequence of the open reading frame for 
P835P with stop codon. 

SEQ ID NO; 776 is the cDNA sequence of the open reading frame for 
30 P835P without stop codon. 
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SEQ ID NO: 777 is the fuli-length amino acid sequence for P835P. 
SEQ ID NO: 778-785 are the amino acid sequences of extracellular and 
intracellular domains of P835P. 

SEQ ID NO: 786 is the full-length cDNA sequence for PIOOOC. 
5 SEQ ID NO; 787 is the cDNA sequence of the open reading frame for 

PIOOOC, including stop codon, 

SEQ ID NO: 788 is the cDNA sequence of the open reading frame for 
PIOOOC. without stop codon. 

SEQ ID NO: 789 is the full-length amino acid sequence for PIOOOC. 

SEQ ID NO: 790 is amino acids 1-100 of SEQ ID NO: 789. 

SEQ ID NO: 791 is amino acids 100-492 of SEQ ID NO: 789. 

SEQ ID NO: 792 is the amino acid sequence of an a prepro-P501S 
recombinant protein. 

1 5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed generally to compositions and their use 
m the therapy and diagnosis of cancer, particularly prostate cancer. As described further 
below, illustrative compositions of the present invention include, but are not restricted 
to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding such 

20 polypeptides, antibodies and other binding agents, antigen presenting cells (APCs) and 
immune system cells {e.g., T cells). 

The practice of the present invention will employ, unless indicated 
specifically to the contrary, conventional methods of virology, immunology, 
microbiology, molecular biology and recombinant DNA tecliniques within the skill of 
25 the art, many of wiiich are described below for the purpose of illustration. Such 
techniques are explained flilly in the literature. See, e.g^, Sambrook, et al. Molecular 
Cloning: A Laboratory' Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: 
A Laboratory^ Manual (1982); DNA Cloning: A Practical Approach, vol. I & II (D. 
Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid 



wo 01/51633 



PCT/IISO 1/01 574 



36 

Hybridization (B. Hames & S, Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, 
A Practical Guide to Molecular Cloning (1984). 

All publications, patents and patent applications cited herein, whether 
5 supra or infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the singular forms 
"a," "an" and "the" include plural references unless the content clearly dictates 
otherwise. 



Polypeptide Compositions 

10 As used herein, the term "polypeptide" " is used in its conventional 

meaning, i.e., as a sequence of amino acids. The polypeptides are not limited to a 
specific length of the product; thus, peptides, oligopeptides, and proteins are included 
within the definition of polypeptide, and such terms may be used interchangeably herein 
unless specifically indicated otherwise. This term also does not refer to or exclude post- 

1 5 expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. A polypeptide may be an entire 
protein, or a subsequence thereof Particular polypeptides of interest in the context of 
this invention are amino acid subsequences comprising epitopes, /.e., antigenic 

20 determinants substantially responsible for the inmiunogenic properties of a polypeptide 
and being capable of evoking an immune response. 

Particularly illustrative polypeptides of the present invention comprise 
those encoded by a polynucleotide sequence set forth in any one of SEQ ID NOs: 1-111, 
115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 381, 382 

25 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593-606, 618- 
626, 630, 631, 634, 636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 735, 737-739, 
751, 753, 764, 765, 773-776 and 786-788, or a sequence that hybridizes under 
moderately stringent conditions, or, alternatively, under highly stringent conditions, to a 
polynucleotide sequence set forth in any one of SEQ ID NOs: 1-111, 115-171, 173-175, 
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177. 179-305, 307-315. 326, 328, 330. 332-335, 340-375. 381, 382 and 384-476, 524. 
526, 530, 53 K 533. 535, 536, 552, 569o72, 587, 591, 593-606. 618-626, 630. 631, 634. 
636, 639-655, 674, 680, 681. 711, 713. 716, 720-722. 735, 737-739, 751, 753, 764, 765, 
773-776 and 786-788. In specific embodiments, the polypeptides of the invention 
5 comprise amino acid sequences as set forth in any one of SEQ ID NO: 112-1 14, 172, 
176. 178, 327, 329, 331, 336, 339, 376-380, 383. 477-483, 496, 504, 505, 519. 520, 
522, 525, 527, 532, 534, 537-551, 553-568. 573-586, 588-590, 592. 627-629, 632, 633, 
635, 637, 638, 656-671, 675, 683, 684, 710, 712, 714, 715, 717-719, 723-734. 736, 740- 
750, 752, 754, 755, 766-772, 777-785 and 789-791. 

The polypeptides of the present invention are sometimes herein referred 
to as prostate-specific proteins or prostate-specific polypeptides, as an indication that 
their identification has been based at least in pait upon their increased levels of 
expression in prostate tissue samples. Thus, a "prostate-specific polypeptide" or 
^^prostate-specific protein," refers generally to a polypeptide sequence of the present 
5 invention, or a polynucleotide sequence encoding such a polypeptide, that is expressed 
in a substantial proportion of prostate tissue samples, for example preferably greater 
than about 20%, more preferably greater than about 30%, and most preferably greater 
than about 50%> or more of prostate tissue samples tested, at a level that is at least two 
fold, and preferably at least five fold, greater than the level of expression in other 
0 normal tissues, as determined using a representative assay provided herein. A prostate- 
specific polypeptide sequence of the invention, based upon its increased level of 
expression in tumor cells, has particular utility both as a diagnostic marker as well as a 
therapeutic target, as further described below. 

In certain preferred embodiments, the polypeptides of the invention are 
5 immunogenic, i,e,, they react detectably within an immunoassay (such as an ELISA or 
I-cell stimulation assay) with antisera and/or T-cells from a patient with prostate 
cancer. Screening for immunogenic activity can be performed using techniques well 
known to the skilled artisan. For example, such screens can be performed using 
methods such as those described in Harlow and Lane. Antibodies: A Laboratory 
} Manual, Cold Spring Harbor Laboratory, 1988. In one illustrative example, a 
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polypeptide may be imiiiobilized on a solid support and contacted with patient sera to 
allow binding of antibodies within the sera to the immobilized polypeptide. Unbound 
sera may then be removed and bound antibodies detected using, for example, '^^I- 
labeled Protein A. 

5 As would be recognized by the skilled artisan, iminunogenic portions of 

the polypeptides disclosed herein are also encompassed by the present invention. An 
"immunogenic portion," as used herein, is a fragment of an immunogenic polypeptide 
of the invention that itself is immunologically reactive (i.e., specifically binds) with the 
B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 

1 0 Immunogenic portions may generally be identified using well known techniques, such 
as those summarized in Paul, Fundamental hnmiinology^, 3rd ed., 243-247 (Raven Press, 
1 993) and references cited therein. Such techniques include screening polypeptides for 
the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or 
clones. As used herein, antisera and antibodies are "antigen-specific" if they 

15 specifically bind to an antigen (/.e., they react with the protein in an ELISA or other 
immunoassay, and do not react detectably with unrelated proteins). Such antisera and 
antibodies may be prepared as described herein, and using well-known techniques. 

In one preferred embodiment, an immunogenic portion of a polypeptide 
of the present invention is a portion that reacts with antisera and/or T-cells at a level that 

20 is not substantially less than the reactivity of the full-length polypeptide {e.g., in an 
ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of 
the immunogenic portion is at least about 50%, preferably at least about 70% and most 
preferably greater than about 90% of the immunogenicity for the full-length 
polypeptide. In some instances, preferred immunogenic portions will be identified that 

25 have a level of immunogenic activity greater than that of the corresponding full-length 
polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 
activity. 

In certain other embodiments, illustrative immunogenic portions may 
include peptides in which an N-terminal leader sequence and/or transmembrane domain 
30 has been deleted. Other illustrative immunogenic portions will contain a small N- 
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and or C-tennina] deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), 
relaiix e to the mature protein. 

In another embodiment, a polypeptide composition of the im ention may 
also comprise one or more polypeptides that are immunologically reactive with T cells 
5 and or antibodies generated against a polypeptide of the invention, particularly a 
polypeptide having an amino acid sequence disclosed herein, or to an immunogenic 
fragment or variant thereof 

In another embodiment of the invention, polypeptides are provided that 
comprise one or more polypeptides that are capable of eliciting T cells and/or antibodies 
0 thai arc unmunoh>gically reactive with one or more polypeptides described herein, or 
one or more pel \ p^-pt ides encoded by contiguous nucleic acid sequences contained in 
the polynucleotide sequences disclosed herein, or iminunogenic fragments or variants 
thereof or to one or more nucleic acid sequences which hybridize to one or more of 
these sequences under conditions of moderate to high stringency. 
5 The present invention, in another aspect, provides polypeptide fragments 

comprising at least about 5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, 
including all intermediate lengths, of a polypeptide composition set forth herein, such as 
ihose set forth in SEQ ID NO: 112-114, 172, 176, 178, 327, 329, 331, 336, 339, 376- 
380, 383, 477-483, 496, 504, 505, 519, 520, 522, 525, 527, 532, 534, 537-551 , 553-568, 
0 573-586, 588-590, 592, 627-629, 632, 633, 635, 637, 638, 656-671, 675, 683, 684, 710, 
712, 714, 715, 717-719, 723-734, 736, 740-750, 752, 754, 755, 766-772, 777-785 and 
789-791, or those encoded by a polynucleotide sequence set forth in a sequence of SEQ 
ID NO: Mil, 115-17K 173-175, 177, 179-305, 307-315, 326, 328, 330,332-335, 340- 
375, 381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 
593-606, 61 8-626, 630. 631, 634, 636, 639-655, 674, 680, 681, 71 1, 713, 716, 720-722, 
735, 737-739, 75 1 , 753, 764, 765, 773-776 and 786-788. 

In another aspect, the present invention provides variants of the 
polypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%, 75%, 
80%, 85%, 90%, 91%. 92%, 93%. 94%, 95%, 96%, 97%, 98%, or 99% or more identit>^ 
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(determined as described below), along its length, to a polypeptide sequence set forth 
herein. 

In one preferred embodiment, the pol)T3eptide fragments and variants 

provided by the present invention are immunologically reactive with an antibody and/or 
5 T-cell that reacts with a full-length polypeptide specifically set forth herein. 

In another preferred embodiment, the polypeptide fragments and variants 
provided by the present invention exhibit a level of innmunogenic activity of at least 
about 50%, preferably at least about 70%, and most preferably at least about 90%) or 
more of that exhibited by a full-length polypeptide sequence specifically set forth 
10 herein. 

A polypeptide "variant," as the term is used herein, is a polypeptide that 
typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 
occurring or may be synthetically generated, for example, by modifying one or more of 

15 the above polypeptide sequences of the invention and evaluating their immimogenic 
activity as described herein using any of a number of techniques well known in the art. 

For example, certain illustrative variants of the polypeptides of the 
invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 

20 include variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino 
acids) has been removed from the N- and/or C-terminal of the mature protein. 

In many instances, a variant will contain conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 

25 chemistry would expect the secondary stmcture and hydropathic nature of the 
polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of the polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide 
with desirable characteristics, e.g., with immunogenic characteristics. When it is 

30 desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 
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c\cn an improved, iirmiunogenic variant or portion of a po]} peptide of the invention, 
one skilled in the art will t>pically change one or more of the codons of the encoding 
DNA sequence according to Table 1. 

For example, certain amino acids may be substituted for other amino 
5 acids in a protein structure without appreciable loss of interactive binding capacity w ith 
stRictures such as, for example, antigen-binding regions of antibodies or binding sites 
on substrate molecuJes. Since it is the interactive capacity and nature of a protein that 
defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA 
H) coding sequence, and nevertheless obtain a protein with like properties. It is thus 
contemplated that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences w^hich encode said peptides 
without appreciable loss of their biological utilit}' or activity. 
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Table 1 



Amino Acids 



Codons 





Ala 


A 


OCA 


GCC 


GCG 


GCU 




Cys 


c 


UGC 


UGU 






/\bp<ii lie dClU 


A cn 


n 


GAC 


GAU 






vJlUldilJlC aL/IU 


V_J 1 IJ 


F 


GAA 


GAG 








Phe 


F 


ULC 


UUU 






Glycine 


vjiy 


n 


GG A 


GGP 

vJ vJ 


GGG 

VJ VJ VJ 


GGU 


Histidine 


rllS 


t T 
i 1 


PAP 

/A V_ 


PA] 1 






Isoleucine 


lie 


1 
1 


AT ] A 


Al IP 


Al n ] 

r\ KJ VJ 




Lysine 


Lys 


K 


AAA 


A AP 






Leucine 


Leu 


T 

L> 


1 11 T A 


1 n IP 

U UVj 


PI 1 A 


PI TP 


Methionine 


iviei 


]V1 


Al IP 








AspB^ragine 


Asn 


IN 


A AP 


AAU 






Proline 


Pro 


P 


CCA 


ccc 


CCG 


ecu 


Glutamine 


Gin 


Q 


CAA 


CAG 






Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


Serine 


Scr 


S 


AGC 


AGU 


UCA 


ucc 


Tlireonine 


Tbr 


T 


ACA 


ACC 


ACG 


ACU 


Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 


Tryptophan 


Trp 


W 


UGG 








Tyrosine 


Tyr 


Y 


UAC 


UAU 







UCG UCU 



In making such changes, the hydropathic index of amino acids may be 
5 considered. The importance of the hydropathic amino acid index in conferring 
interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein v^ith other 
10 molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of its 
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li\ drophobicit>' and charge characteristics (Kyte and Doohttle, 19S2). These values are: 
isoleucine (+4.5); valine (-^4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(-r2.5): methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7): serine (-0.8); 
iTyptophan (-0.9); tyrosine (-1.3); proline (1.6); histidine (-3.2); glutamate (-3.5); 
5 glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other 
ajiiino acids having a similar hydropathic index or score and still result in a protein with 
similar biological activity, i,e. still obtain a biological functionally equivalent protein. 
In making such changes, the substitution of amino acids whose hydropathic indices are 
10 within ±2 is preferred, those within +1 are particularly preferred, and those within ±0.5 
are even more particularly preferred. It is also understood in the art that the substitution 
of like amino acids Ccin be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,10] (specifically incorporated herein by reference in its entirety), stains that the 
greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 
1 5 its adjacent amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,101, the following hydrophilicity values 
have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
i+3.0± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); tlireonine (-0.4); proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine 
20 (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyTOsine (- 
2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids w'hose hydrophilicity values are within ±2 
25 is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even 
more particularly preferred. 

As outlined above, amino acid substitutions aie generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicit>s hydrophilicit>% charge, size, and the like. Exemplary substitutions that 
30 take various of the foregoing characteristics into consideration are well known to those 
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of ski]] in the art cind include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and vaUne, leucine and isoleucine. 

In addition, any polynucleotide may be ftirther modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
5 flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or 2' O-methyl 
rather than phosphodiesterase linkages in the backbone; and/or the inclusion of 
nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and 
uridine. 

10 Amino acid substitutions may further be made on the basis of similarity 

in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 
nature of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids whh uncharged polar head groups having similar hydrophilicity values 

15 include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 
represent conservative changes include: (l)ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, tlir; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, 
his. A variant may also, or alternatively, contain nonconservative changes. In a 

20 prefened embodiment, variant polypeptides differ from a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 
alternatively) be modified by, for example, the deletion or addition of amino acids that 
have minimal influence on the immunogenicity, secondary structure and hydropathic 
nature of the polypeptide. 

25 As noted above, polypeptides may comprise a signal (or leader) sequence 

at the N-terminal end of the protein, which co-translationally or post-translationally 
directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
other sequence for ease of synthesis, purification or identification of the polypeptide 
(e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For 

30 example, a polypeptide may be conjugated to an immunoglobulin Fc region. 
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WTien comparing polypeptide sequences, two sequences are said to be 
'•identical*' if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
sequences are typically perfonned by comparing the sequences over a comparison 
5 window to identify and compare local regions of sequence similarity. A "comparison 
window'' as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in w^hich a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

10 Optimal alignment of sequences for comparison may be conducted using 

the Megalign program in the Lasergene suite of bioinfonnatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 

15 In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, SuppL 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzyinology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Shaip, P.M. (1989) 
CABIOS 5:\5]-\53; Myers, E.W. and Muller W. (1988) CABIOS 4:UA1\ Robinson, 

20 E.D. (1971) Comb, Theor 77:105; Santou, N. Nes, M. (1987) Mol BioL Evol 4'AQ6- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, WJ. and 
Lipman, D.J. (1983) Proc. Nail Acad, Sci. USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 

25 conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
Proc Natl Acad. ScL USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
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Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
5 algorithms, which are described in Altschul et al. (1 977) Nucl Acids Res, 25:3389-3402 
and Altschul et al. (1990) J. Mol BioL 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 

10 Biotechnology Information. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 

1 5 reached. The BLAST algorithm parameters W, T and X determine the sensitivity and 
speed of the alignment. 

In one preferred approach, the "percentage of sequence identity" is 
determined by comparing two optimally aligned sequences over a window of 
comparison of at least 20 positions, wherein the portion of the polypeptide sequence in 

20 the comparison window may comprise additions or deletions {Le., gaps) of 20 percent 
or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 
which the identical amino acid residue occurs in both sequences to yield the number of 

25 matched positions, dividing the number of matched positions by the total mmiber of 
positions in the reference sequence {i.e., the window size) and multiplying the results by 
100 to yield the percentage of sequence identity. 

Within other illustrative embodiments, a polypeptide may be a fusion 
polypeptide that comprises multiple polypeptides as described herein, or that comprises 

30 at least one polypeptide as described herein and an unrelated sequence, such as a known 
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tumor protein. A fusion partner may, for example, assist in providing T helper epitopes 
(an immunological fusion partner;, preferably T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fiision partners arc both immunological 
5 and expression enhancing fusion partners. Other flision partners may be selected so as 
to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to 
desired intracellular compartments. Still further fusion partners include affinity tags, 
which facilitate purification of the polypeptide. 

Fusion polypeptides may generally be prepared using standard 
0 techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fused polypeptide, in an expression system. Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 
appropriate expression vector. Ihe 3' end of the DNA sequence encoding one 
5 polypeptide component is ligated, with or without a peptide linker, to the 5' end of a 
DNA sequence encoding the second polypeptide component so that the reading frames 
of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retains the biological activity of both component polypeptides. 

A peptide linker sequence may be employed to separate the first and 
0 second polypeptide components by a distance sufficient to ensure that each polypeptide 
folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypepfide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 
(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 
secondary structure that could interact with functional epitopes on the first and second 
polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
used in the linker sequence. Amino acid sequences which may be usefully employed as 
linkers include those disclosed in Maratea et al.. Gene 40:39-46, 1985; Muiphy etal.. 
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Proc, Natl Acad ScL USA «:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 
Patent No. 4,751,180. The linker sequence may generally be from 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 

5 separate the functional domains and prevent steric interference. 

The ligated DNA sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements 
responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 

10 transcription termination signals are only present 3' to the DNA sequence encoding the 
second polypeptide. 

The fusion polypeptide can comprise a polypeptide as described herein 
together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 

15 tuberculosis and hepatitis proteins {see, for example, Stoute et al. New Engl J, Med., 
33d:86-91, 1997). 

In one preferred embodiment, the immunological fusion partner is 
derived from a Mycobacterium sp., such as ?i Mycobacterium tuberculosis-d^xxwcd Ral2 
fragment. Ral2 compositions and methods for their use in enhancing the expression 

20 and/or immunogenicity of heterologous polynucleotide/polypeptide sequences is 
described in U.S. Patent Application 60/158,585, the disclosure of which is 
incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 

25 and avirulent strains of M tuberculosis. The nucleotide sequence and amino acid 
sequence of MTB32A have been described (for example, U.S. Patent Application 
60/158,585; see also, Skeiky et al. Infection and Immun (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the 

30 purification process. Moreover, Ral2 may enhance the immunogenicity of heterologous 
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iimnunogenic polypeptides with which it is fused. One preferred Ral2 fusion 
polypeptide comprises a ]4 KD C-terminal fragment corresponding to amino acid 
residues 192 to 323 of MTB32A. Other preferred Ral2 polynucleotides generally 
comprise at least about 15 consecutive nucleotides, at least about 30 nucleotides, at least 
5 about 60 nucleotides, at least about 100 nucleotides, at least about 200 nucleotides, or at 
least about 300 nucleotides that encode a portion of a Ral2 polypeptide. Ral2 
polynucleotides may comprise a native sequence (i.e., an endogenous sequence that 
encodes a Ral2 polypeptide or a portion thereof) or may comprise a variant of such a 
sequence. Ral2 polynucleotide variants may contain one or more substitutions, 
0 additions, deletions and/or insertions such that the biological activity of the encoded 
fusion polypeptide is not substantially diminished, relative to a fusion polypeptide 
comprising a native Ral2 polypeptide. Variants preferably exhibit at least about 70% 
identity, more preferably at least about 80% identity and most preferably at least about 
90% identity to a polynucleotide sequence that encodes a native Ral2 polypeptide or a 
5 portion thereof. 

Within other preferred embodiments, an immunological fusion partner is 
derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
mfluenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein (e.g., the first N-terminal 100-110 amino 
0 acids), and a protein D derivative may be lipidated. Within certain preferred 
embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide w^ith additional exogenous T-cell epitopes and to 
increase the expression level in E, coli (thus functioning as an expression enhancer). - 
I he lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fusion partners include the non-structural protein from influenzae virus, NSl 
(hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

In another embodiment, the inmiunological fusion partner is the protein 
known as LYTA, or a portion thereof (preferably a C-tenninal portion). LYTA is 
derived from Sti'eptococcus pneumoniae, w^hich synthesizes an N-acety4-L-aIanine 
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amidase known as amidase LYTA (encoded by the LytA gene; Gene -/5:265-292, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 
backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to 
the choline or to some choline analogues such as DEAE. This property has been 
5 exploited for the development of E. coU C-LYTA expressing plasmids useful for 
expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA 
fragment at the amino terminus has been described {see Biotechnology 70:795-798, 
1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated 
into a ftision polypeptide. A repeat portion is found in the C-terminal region starting at 

10 residue 178. A particularly preferred repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fusion polypeptides, and 
the polynucleotides encoding them, wherein the fusion partner comprises a targeting 
signal capable of directing a polypeptide to the endosomal/lysosomal compartment, as 
described in U.S. Patent No. 5,633,234. An inununogenic polypeptide of the invention, 

15 when fused with this targeting signal, will associate more efficiently with MHC class II 
molecules and thereby provide enhanced in vivo stimulation of CD4^ T-cells specific 
for the polypeptide. 

Polypeptides of the invention are prepared using any of a variety of well 
known synthetic and/or recombinant techniques, the latter of which are further 

20 described below. Polypeptides, portions and other variants generally less than about 
1 50 amino acids can be generated by synthetic means, using techniques well known to 
those of ordinary skill in the art. In one illustrative example, such polypeptides are 
synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 

25 growing amino acid chain. See Merrifield, J. Am, Chew, Sac. 55:2149-2146, 1963. 
Equipment for automated synthesis of polypeptides is commercially available from 
suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and 
may be operated according to the manufacturer's instructions. 

In general, polypeptide compositions (including fusion polypeptides) of 

30 the invention are isolated. An "isolated" polypeptide is one that is removed from its 
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onuinal enviromiient. For example, a naturally-occurring protein or polypeptide is 
isolated if it is separated from some or all of the coexisting materials in the natairal 
system. Preferably, such polypeptides are also purified, e.g.. are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
5 pure. 

I^LllAinuclct^tiJ e Com positions 

The present invention, in other aspects, provides polynucleotide 
compv^sinuns, fhe terms "DNA" and "polynucleotide" are used essentially 
]nterchanL!cjhl> herein to refer to a DNA molecule that has been isolated free of total 
10 genomic DNA of a particular species. "Isolated," as used herein, means that a 
po!\ nucleotide is substantially away from other coding sequences, and that the DNA 
molecule docs not contain large portions of unrelated coding DNA, such as large 
chiomosomal fragments or other functional genes or polypeptide coding regions. Of 
course, this refers to the DNA molecule as originally isolated, and does not exclude 
1 5 genes or coding regions later added to the segment by the hand of man. 

As will be understood by those skilled in the art, the polynucleotide 
compositions of this invention can include genomic sequences, extra-genomic and 
plasmid-encoded sequences and smaller engineered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments may 
20 be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 
invention may be single-stranded (coding or antisense) or double-stranded, and may be 
DNA (genomic. cDNA or synthetic) or RNA molecules. RNA molecules may include 
IhiRNA molecules, which contain introns and correspond to a DNA molecule in a one- 
25 to-one manner, and mRNA molecules, which do not contain introns. Additional coding 
or non-coding sequences may, but need not, be present within a polynucleotide of the 
present invention, and a polynucleotide may, but need not, be linked to other molecules 
and/or support materials. 
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Polynucleotides may comprise a native sequence {i,e,, an endogenous 
sequence that encodes a polypeptide/protein of the invention or a portion thereof) or 
may comprise a sequence that encodes a variant or derivative, preferably an 
immunogenic variant or derivative, of such a sequence. 
5 Therefore, according to another aspect of the present invention, 

polynucleotide compositions are provided that comprise some or all of a polynucleotide 
sequence set forth in any one of SEQ ID NOs: 1-111, 115-171, 173-175, 177, 179-305, 
307-315, 326, 328, 330, 332-335, 340-375, 381, 382 and 384-476, 524, 526, 530, 531, 
533, 535, 536, 552, 569-572, 587, 591, 593-606, 618-626, 630, 631, 634, 636, 639-655, 

10 674, 680, 681, 711, 713, 716, 720-722, 735, 737-739, 751, 753, 764, 765, 773-776 and 
786-788, complements of a polynucleotide sequence set forth in any one of SEQ ID 
NOs: 1-111, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340- 
375, 381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 
593-606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 

15 735, 737-739, 751, 753, 764, 765, 773-776 and 786-788, and degenerate variants of a 
polynucleotide sequence set forth in any one of SEQ ID NOs: 1-111, 115-171, 173-175, 
177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 381, 382 and 384-476, 524, 
526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593-606, 618-626, 630, 631, 634, 
636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 735, 737-739, 751, 753, 764, 765, 

20 773-776 and 786-788. In certain preferred embodiments, the polynucleotide sequences 
set forth herein encode immunogenic polypeptides, as described above. 

In other related embodiments, the present invention provides 
polynucleotide variants having substantial identity to the sequences disclosed herein in 
SEQ ID NOs: 1-111, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332- 

25 335, 340-375, 381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 
587, 591, 593-606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 681, 71 1, 713, 716, 
720-722, 735, 737-739, 751, 753, 764, 765, 773-776 and 786-788, for example those 
comprising at least 70% sequence identity, preferably at least 75%, 80%, 85%, 90%, 
95%, 96%>, 97%, 98%, or 99% or higher, sequence identity compared to a 

30 polynucleotide sequence of this invention using the methods described herein, (e.g.. 
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BLAST analysis using standard parameters, as described below). One skilled in this art 
will recognize that these values can be appropriately adjusted to detemrine 
corresponding identit>^ of proteins encoded by two nucleotide sequences by taking into 
account codon degeneracy, amino acid similarity, reading frame positioning and the 
5 like. 

Typically, polynucleotide variants will contain one or more substitutions, 
additions, deletions and/or insertions, preferably such that the immunogenicity of the 
polypeptide encoded by the variant polynucleotide is not substantially diminished 
relative to a polypeptide encoded by a polynucleotide sequence specifically set forth 
] 0 herein). The term "variants" should also be understood to encompasses homologous 
genes of xenogenic origin. 

In additional embodiments, the present invention provides 
polynucleotide fragments comprising various lengths of contiguous stretches of 
sequence identical to, or complementary^ to, one or more of the sequences disclosed 

1 5 herein. For example, polynucleotides are provided by this invention that comprise at 
least about 10, 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 
contiguous nucleotides of one or more of the sequences disclosed herein as well as all 
intennediate lengths there between. It will be readily understood that "intermediate 
lengths", in this context, means any length between the quoted values, such as 16, 17. 

20 18, 19, etai 21, 22, 23, etc; 30, 31, 32, etc; 50, 51, 52, 53, etc; 100, 101, 102, 103, 
etc.\ 150, 151, 152, 153, etc,\ including all integers through 200-500; 500-1,000, and the 
like. 

In another embodiment of the invention, polynucleotide compositions are 
provided that are capable of hybridizing under moderate to high stringency conditions to 

25 a polynucleotide sequence provided herein, or a fragment thereof, or a complementary 
sequence thereof. Hybridization techniques are well known in the art of molecular 
biology. For purposes of illustration, suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other polynucleotides 
include prewashing in a solution of 5 X SSC, 0.5% SDS, TO mM EDTA (pH 8.0); 

30 hybridizing at 50X-60^C, 5 X SSC. overnight; follow^ed by washing twice at 65^C for 
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20 minutes with each of 2X, 0.5X and 0,2X SSC containing 0.1% SDS. One skilled in 
the an will understand that the stringency of hybridization can be readily manipulated, 
such as by altering the salt content of the hybridization solution and/or the temperature 
at which the hybridization is performed. For example, in another embodiment, suitable 
5 highly stringent hybridization conditions include those described above, with the 
exception that the temperature of hybridization is increased, e.g., to 60-65^C or 65- 
70T. 

In certain preferred embodiments, the polynucleotides described above, 
e.g., p<.>I\ nucleotide variants, fragments and hybridizing sequences, encode polypeptides 

10 that are inimunologically cross-reactive with a polypeptide sequence specifically set 
foith herein. In other preferred embodiments, such polynucleotides encode 
polypeptides thai have a level of immunogenic activity of at least about 50%, preferably 
at least about 70'^o, and more preferably at least about 90% of that for a polypeptide 
sequence specifically set forth herein. 

15 The polynucleotides of the present invention, or fragments thereof, 

regardless of the length of the coding sequence itself, may be combined with other DNA 
sequences, such as promoters, polyadenylation signals, additional restriction enzyme 
sites, multiple cloning sites, other coding segments, and the like, such that their overall 
length may vary considerably. It is therefore contemplated that a nucleic acid fi*agment 

20 of almost any length may be employed, with the total length preferably being limited by 
the ease of preparation and use in the intended recombinant DNA protocol. For 
example, illustrative polynucleotide segments with total lengths of about 10,000, about 
5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 
base pairs in length, and the like, (including all intermediate lengths) are contemplated 

25 to be useful in many implementations of this invention. 

When comparing polynucleotide sequences, two sequences are said to be 
"identical" if the sequence of nucleotides in the two sequences is the same when aligned 
for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 

30 window to identify and compare local regions of sequence similarity. A "comparison 
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window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75. preferably 40 to about 50, in which a sequence may be compared 
to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. 

Optima] alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
alignjiient schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionar>^ change in proteins - Matrices for detecting distant relationships. 
0 In Dayhoff. M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
L/niried Approach to Alignment and Phylogenes pp. 626-645 Methods in Euzynnology^ 
vol. 183, Academic Press, Inc.. San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and MuUer W. (1988) CABIOS ^:11-17; Robinson, 
5 h.D. (1971) Co?nb, Theor Il:\05\ Santou, N. Nes, M. (1987) AdoL Biol. EvoL 4'AQ6- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 
Fipman, D.J. {\9^?>)Proc. Natl. Acad, ScL USA 50:726-730. 

Alternatively, optima] alignment of sequences for comparison may be 
) conducted by the local identity algorithm of Smith and Waterman (1981) Add. API. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) j: 
Mol BioL 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
Proc. Natl Acad. Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
> Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, ^\a), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithins, which are described in AJtschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
* and Altschul et al. (1990) J. Mol BioL 215:403-410, respectively. BLAST and BLAST 
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2.0 can be used, lor example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides of the invention. Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology 
Information. In one illustrative example, cumulative scores can be calculated using, for 
5 nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 
the word hits in each direction are halted when: the cumulative alignment score falls off 
by the quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue alignments; 

10 or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
detenuine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sequences) uses as defaults a wordlength (W) of 1 1 , and expectation (E) of 
10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl 
Acad. Sci. USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M-5, N--4 and 

1 5 a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions {Le,, gaps) of 20 percent or less, usually 5 

20 to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical 
nucleic acid bases occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the 

25 reference sequence {i.e., the window size) and multiplying the resuhs by 100 to yield the 
percentage of sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a result 
of the degeneracy of the genetic code, there are many nucleotide sequences that encode 
a polypeptide as described herein. Some of these polynucleotides bear minimal 

30 homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
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that van- due to differences in codon usage are specificallx contempJated by the present 
invention. Further, alleles of the genes comprising the polynucleotide sequences 
provided herein are within the scope of the present invention. Alleles are endogenous 
genes that are altered as a result of one or more mutations, such as deletions, additions 
5 and/or substitutions of nucleotides. The resulting mRNA and protein may, but need not, 
have an altered structure or function. Alleles may be identified using standard 
techniques (such as hybridization, amplification and/or database sequence comparison). 

Therefore, in another embodiment of the invention, a mutagenesis 
approach, such as site-specific mutagenesis, is employed for the preparation of 
10 immunogenic variants and/or derivatives of the polypeptides described herein. By this 
approach, specific modifications in a polypeptide sequence can be made through 
mutagenesis of the underlying polynucleotides that encode them. These techniques 
provides a straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 
1 5 nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the 
use of specific oligonucleotide sequences which encode the DNA sequence of the 
desired mutation, as we]] as a sufficient number of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on 
20 both sides of the deletion junction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
change the properties of the polynucleotide itself, and/or alter the properties, activity, 
composition, stability, or primary sequence of the encoded polypeptide. 

In certain embodiments of the present invention, the inventors 
25 contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
more properties of the encoded polypeptide, such as the immunogenicity of a 
polypeptide vaccine. The techniques of site-specific mutagenesis are well-knowTi in the 
art, and are widely used to create variants of both polypeptides and polynucleotides. For 
example, site-specific mutagenesis is often used to alter a specific portion of a DNA 
30 molecule. In such embodiments, a primer comprising typically about 14 to about 25 
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nucleotides or so in length is employed, with about 5 to about 10 residues on both sides 
of the junction of the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 
mutagenesis techniques have often employed a phage vector that exists in both a single 
5 stranded and double stranded fomi. Typical vectors useful in site-directed mutagenesis 
include vectors such as the Ml 3 phage. These phage are readily commercially-available 
and their use is generally well-knoAvn to those skilled in the art. Double-stranded 
plasmids are also routinely employed in site directed mutagenesis that eliminates the 
step of transferring the gene of interest from a plasmid to a phage. 

10 In general, site-directed mutagenesis in accordance herewith is 

performed by first obtaining a single-stranded vector or melting apart of two strands of a 
double-stranded vector that includes within its sequence a DNA sequence that encodes 
the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 

15 vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I 
Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is 
then used to transform appropriate cells, such as E. coli cells, and clones are selected 

20 which include recombinant vectors bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected peptide-encoding 
DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and the DNA sequences encoding them may be 

25 obtained. For example, recombinant vectors encoding the desired peptide sequence 
may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy et al, 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
Maniatis et al , 1 982, each incorporated herein by reference, for that purpose. 
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As used herein, the tcnn ^'oligonucleotide directed mutagenesis 
procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
5 signal, such as amplification. As used herein, the term ''oligonucleotide directed 
mutagenesis procedure" is intended to refer to a process that involves the 
template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-knowTi 
0 rules of complementary base pairing (see, for example, Watson, 1987). Typically, 
vector mediated methodologies involve the introduction of the nucleic acid fragment 
into a DNA or RNA vector the clonal amplification of the vector, and the recovery of 
the amplified nucleic acid fragment. Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

In another approach for the production of polypeptide variants of the 
present invention, recursive sequence recombination, as described in U.S. Patent No. 
5,837,458, may be employed. In this approach, iterative cycles of recombination and 
screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. 

In other embodiments of the present invention, the polynucleotide 
sequences provided herein can be advantageously used as probes or primers for nucleic 
acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
a sequence region of at least about 15 contiguous nucleotides that has the same 
sequence as, or is complementary^ to, a 15 nucleotide long contiguous sequence 
disclosed herein will find particular utility. Longer configuous identical or 
complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 
(including all intermediate lengths) and even up to full length sequences will also be of 
use in certain embodiments. 

The abi]it>^ of such nucleic acid probes to specifically hybridize to a 
sequence of interest will enable them to be of use in detecting the presence of 
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complementary sequences in a given sample. However, other uses are also envisioned, 
such as the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
5 contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
or so (including intermediate lengths as v^ell), identical or complementary to a 
polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e.g.. Southern and Northern blotting. This would allow a gene 
product, or fragment thereof, to be analyzed, both in diverse cell types and also in 

10 various bacterial cells. The total size of fragment, as well as the size of the 
complementary stretch(es), will ultimately depend on the intended use or application of 
the particular nucleic acid segment. Smaller fragments will generally find use in 
hybridization embodiments, wherein the length of the contiguous complementary region 
may be varied, such as between about 15 and about 100 nucleotides, but larger 

15 contiguous complementarity stretches may be used, according to the length 
complementaiy sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 
allows the formation of a duplex molecule that is both stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than 15 bases in 

20 length are generally preferred, though, in order to increase stability and selectivity of the 
hybrid, and thereby improve the quality and degree of specific hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules having gene- 
complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

25 Hybridization probes may be selected from any portion of any of the 

sequences disclosed herein. All that is required is to review the sequences set forth 
herein, or to any continuous portion of the sequences, from about 15-25 nucleotides in 
length up to and including the full length sequence, that one wishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
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factors. For example, one may wish to employ primers from towards the tennini of the 
K)tal sequence. 

Small poh nucleotide segments or fragments may be readily prepared by, 
for example, directly synthesizing the fragment by chemical means, as is commonly 
5 practiced using an automated oligonucleotide synthesizer. AJso, fragments may be 
obtained by application of nucleic acid reproduction technology, such as the PGR™ 
technology of U. S. Patent 4,683,202 (incorporated herein by reference), by introducing 
selected sequences into recombinant vectors for recombinant production, and by other 
recombinant DNA techniques generally known to those of skill in the art of molecular 
10 biology. 

llie nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementar>^ stretches of the entire gene or 
gene fragments of interest. Depending on the application envisioned, one will tN^pically 
desire to employ varying conditions of hybridization to achieve var>ing degrees of 

5 selectivity of probe towards target sequence. For applications requiring high selectivity, 
one will typically desire to employ relatively stringent conditions to form the hybrids, 
e.g., one will select relatively low salt and/or high temperature conditions, such as 
provided by a salt concentration of from about 0.02 M to about 0.15 M salt at 
temperatures of from about 50°C to about 70°C. Such selective conditions tolerate 

:0 httle, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 

Of course, for some applications, for example, where one desires to 
prepare mutants employing a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 

5 needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 
salt, at temperatures ranging from about 20''C to about 55''C. Cross-hybridizing species 
can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 

0 more stringent by the addition of increasing amounts of formamide, which serves to 
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destabilize the hybrid duplex in the same manner as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 

According to another embodiment of the present invention, 
5 polynucleotide compositions comprising antisense oligonucleotides are provided. 
Antisense oligonucleotides have been demonstrated to be effective and targeted 
inhibitors of protein synthesis, and, consequently, provide a therapeutic approach by 
which a disease can be treated by inhibiting the synthesis of proteins that contribute to 
the disease. The efficacy of antisense oligonucleotides for inhibiting protein synthesis 

10 is well established. For example, the synthesis of polygalactaxironase and the muscarine 
type 2 acetylcholine receptor are inhibited by antisense oligonucleoddes directed to their 
respective mRNA sequences (U. S. Patent 5,739,119 and U. S. Patent 5,759,829). 
Further, examples of antisense inhibition have been demonstrated with the nuclear 
protein, cyclin, the multiple drug resistance gene (MDGl), ICAM-l, E-selectin, STK-1, 

15 striatal GABAa receptor and human EOF (Jaskulski et al. Science. 1988 Jun 
10;240(4858): 1544-6; Vasanthakumar and Ahmed, Cancer Conunun. 1989;1(4):225- 
32; Peris et aL, Brain Res Mol Brain Res. 1998 Jun 15;57(2):3 10-20; U. S. Patent 
5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patent 5,610,288). 
Antisense constructs have also been described that inhibit and can be used to treat a 

20 variety of abnormal cellular proliferations, e.g. cancer (U. S, Patent 5,747,470; U. S. 
Patent 5,591,317 andU. S. Patent 5,783,683). 

Therefore, in certain embodiments, the present invention provides 
oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
capable of specifically binding to polynucleotide sequence described herein, or a 

25 complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 
or derivatives thereof In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof In a third embodiment, the oligonucleotides are modified DNAs 
comprising a phosphorothioated modified backbone. In a fourth embodiment, the 
oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 

30 each case, preferred compositions comprise a sequence region that is complementary. 
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and more preferably substantially-complenientans and even more preferably, 
compleielx' coinpleinentarv' to one or more portions of polynucleotides disclosed herein. 
Selecnon of antisense compositions specific for a given gene sequence is based upon 
anal}'sis of the chosen target sequence and determination of secondar>^ structure, T^, 
> binding energy, and relative stability. Antisense compositions may be selected based 
upon their relative inability to form dimers, hairpins, or other secondary stmctures that 
would reduce or prohibit specific binding to the target mRNA in a host cell. Highly 
preferred target regions of the mRNA, are those which are at or near the AUG 
transhition ir)iiiatii)n codon, and those sequences which are substantially complementary 
10 to 5* rej^ions of the niRNA. These secondar>^ structure analyses and target site selection 
consideratiims can he performed, for example, using v.4 of the OLIGO primer analysis 
soltware and/or liie Bl.ASTN 2.0.5 algorithm software (Altschul et al, Nucleic Acids 
Res. H»7 Sep 1 :25( 1 7):3389-402). 

f lie use of an antisense delivery^ method employing a short peptide 

15 vector, tenned MPG (27 residues), is also contemplated. Tlie MPG peptide contains a 
hydrophobic domain derived from the fusion sequence of HIV gp41 and a hydrophilic 
domain Ironi the nuclear localization sequence of SV40 T-antigen (Morris et al,. 
Nucleic Acids Res. 1997 Jul 1 5:25(1 4):2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 

20 into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). 
Further, the interaction with MPG strongly increases both the stability of the 
oligonucleotide to nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the polynucleotide 
compositions described lierein aie used in the design and preparation of ribozyme 

25 molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor cells. Riboz>TOes are RNA-protein complexes that cleave 
nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 
possess endonuclease activity (Kim and Cech, Proc Natl Acad Sci USA. 1987 
Dec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;49(2):2n-20). For 

30 example, a large number of riboz>'mes accelerate phosphoester transfer reactions with a 
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high degree of specificity, often cleaving only one of several phosphoesters in an 
oligonucleotide substrate (Cech et al. Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol Biol. 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1992 May 14;357(6374): 173-6). This specificity has been attributed to the requirement 
5 that the substrate bind via specific base-pairing interactions to the internal guide 
sequence ("IGS") of the ribozyme prior to chemical reaction. 

Six basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 
thus can cleave other RNA molecules) under physiological conditions. In general, 

10 enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 
through the target binding portion of a enzymatic nucleic acid which is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 

15 cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 
cleaved its RNA target, it is released from that RNA to search for another target and can 
repeatedly bind and cleave new targets. 

The enzymatic nature of a ribozyme is advantageous over many 

20 technologies, such as antisense technology (where a nucleic acid molecule simply binds 
to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessary to aifect a therapeutic treatment is lower than that of an antisense 
oligonucleotide. This advantage reflects the ability of the ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of 

25 target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 
of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 
substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their action 

30 (Woolf etal, Proc Natl Acad Sci USA, 1992 Aug 15;89(16):7305-9). Thus, the 
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specificity of action of a riboz} me is greater than that of an antisense ohgonucleotide 
binding the same RNA site. 

The enz>'matic nucleic acid molecule may be formed in a hammerhead, 
hairpin, a hepatitis 8 virus, group I intron or RNaseP RNA (in association with an RNA 
5 guide sequence) or Neurospora VS RNA motif Examples of hammerhead motifs are 
described by Rossi er al Nucleic Acids Res. 1992 Sep 1 1 ;20(] 7):4559-65. Examples of 
haiipin motifs are described by Hampel et al (Eur, Pat. AppL Publ. No. EP 0360257), 
Hampel and Tritz, Biochemistr}^ 1989 Jun 13;28(]2):4929-33; Hampel et al, Nucleic 
Acids Res. 1990 Jan 25;1 8(2):299-304 and U. S. Patent 5,631,359. An example of the 
0 hepatitis 5 virus motif is described by Perrotta and Been, Biochemistr}\ 1992 Dec 
1 ;3 1(47): 1 1843-52; an example of the RNaseP motif is described by Guerrier-Takada 
etal. Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RN^A riboz}^me motif is 
described by Collins (Saville and Collins, Cell. 1990 May 1 8;6] (4):685-96; Saville and 
Collins, Proc Natl Acad Sci USA. 1991 Oct l;88(19):8826-30; Collins and Olive, 
5 Biochemistr>^ 1993 Mar 23;32(1 1):2795-9); and an example of the Group I intron is 
described in (U. S. Patent 4,987,071). All that is important in an enzymatic nucleic acid 
molecule of this invention is that it has a specific substrate binding site which is 
complementary to one or more of the target gene RNA regions, and that it have 
nucleotide sequences within or suiTOunding that substrate binding site wliich impart an 
0 RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be 
limited to specific motifs mentioned herein. 

Ribozymes may be designed as described in Int. Pat. Appl. PubL No. 
WO 93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically 
incorporated herein by reference) and synthesized to be tested in vitro and in vivo, as 
described- Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in the art will recognize that equivalent RNA targets in 
other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 
riboz\™e binding arms, or chemically synthesizing ribozy^mes with modifications that 
prevent their degradation by serum ribonucleases (see e.g,. Int. Pat. AppL Publ. No. WO 



wo (M/51633 



PC T/l) SO 1/0 1574 



66 

92/07065; Int. Pat. Appl. PubL No. WO 93/15187; Int. Pat, AppL PubL No. WO 
91/03162; Eur. Pat. Appl. PubL No. 92110298.4; U. S. Patent 5,334,711; and Int. Pat. 
Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can 
be made to the sugar moieties of enzymatic RNA molecules), modifications which 
5 enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements. 

Sullivan ei al (Int. Pat. AppL PubL No. WO 94/02595) describes the 
general n}cthods for delivery of enzymatic RNA molecules. Ribozymes may be 
administered to cells by a variety of methods known to those familiar to the art, 

10 including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 
direct!}' delivered ex vivo to cells or tissues with or without the aforementioned vehicles. 
Alternatively, the RNA/vehicle combination may be locally delivered by direct 

15 inhalation, by direct injection or by use of a catheter, infusion pump or stent. Other 
routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 
of ribozyme delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 

20 94/02595 and Int. Pat. AppL PubL No. WO 93/23569, each specifically incorporated 
herein by reference. 

Another means of accumulating high concentrations of a ribozyme(s) 
within cells is to incorporate the ribozyme-encoding sequences into a DNA expression 
vector. Transcription of the ribozyme sequences are driven from a promoter for 

25 eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase 
III (pol III). Transcripts from pol II or pol III promoters will be expressed at high levels 
in all cells; the levels of a given pol II promoter in a given cell type will depend on the 
nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 
Prokaryotic RNA polymerase promoters may also be used, providing that the 

30 prokaryotic RNA polymerase enzyme is expressed in the appropriate cells Ribozymes 
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expressed from such promoters ha\e been shown to function in maniniahan cells. Such 
transcription units can be incorporated into a variet\^ of vectors for introduction into 
njanin:ialian cells, including but not restricted to, plasmid DNA vectors, viral UNA 
\ectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors {such as 

3 retroviral, semliki forest virus, sindbis virus vectors). 

In another embodiment of the invention, peptide nucleic acids (PNAs) 
compositions are provided. PNA is a DNA mimic in which the nucleobases are 
attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 
Dev. 1997 7(4) 431-37). PNA is able to be utilized in a number methods that 

0 traditionally have used RNA or DNA. Often PNA sequences perform better in 
techniques than the corresponding RNA or DNA sequences and have utilities that are 
not inlierent to RNA or DNA. A review of PNA including methods of making, 
characteristics of and methods of using, is provided by Corey (Trends Biotechnol 1 997 
Jun;l 5(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 

5 that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 
such PNA compositions have been administered. 

PNAs have 2-ajBinoethyl-glycine linkages replacing the normal 

0 phosphodiester backbone of DNA (Nielsen et al. Science 1991 Dec 6;254(5037):1497- 
500; Hanvey et al. Science. 1992 Nov 27;258(5087):1481-5; Hyrup and Nielsen, 
Bioorg Med Chem. 1996 Jan;4(l):5-23). This chemistry has three important 
consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs 
are neutral molecules; secondly, PNAs are achiral, which avoids the need to develop a 

3 stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc or Fmoc 
protocols for solid-phase peptide synthesis, although other methods, including a 
modified Merrifield method, have been used. 

PNA monomers or ready-made oligomers are commercially available 
from PerSeptive Biosystems (Framingham, NIA). PNA syntheses by either Boc or 

0 Fmoc protocols are straightforward using manual or automated protocols (Norton et aL, 
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Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
production of chemically modified PNAs or the simultaneous synthesis of families of 
closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will 
5 depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
can lead to deletions of one or more residues in the product. In expectation of this 
difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be followed 
10 by the purification of PNAs by reverse-phase high-pressui-e liquid chromatography, 
providing yields and pxuity of product similar to those observed during the synthesis of 
peptides. 

Modifications of PNAs for a given application may be accomplished by 
coupling amino acids during solid-phase synthesis or by attaching compounds that 

15 contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease witli which PNAs can be modified facilitates optimization for better solubility or 
for specific functional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 

20 utilized modifications of PNAs (for example, Norton et al^ Bioorg Med Chem. 1995 
Apr;3(4):437-45; Petersen et al, J Pept Sci. 1995 May-Jun;l (3): 175-83; Orum et al, 
Biotechniques. 1995 Sep;19(3):472-80; Footer et aL, Biochemistry. 1996 Aug 
20;35(33): 10673-9; Griffith et ai. Nucleic Acids Res. 1995 Aug 1 1;23(15):3003-8; 
Pardridge et al, Proc Natl Acad Sci USA. 1995 Jun 6;92(12):5592-6; Boffa et al, 

25 Proc Natl Acad Sci USA. 1995 Mar 14;92(6):1901-5; Gambacorti-Passerini et aL, 
Blood. 1996 Aug 15;88(4):141 1-7; Armitage et al, Proc Natl Acad Sci USA. 1997 
Nov 11;94(23): 12320-5; Seeger et al, Biotechniques. 1997 Sep;23(3):512-7). U.S. 
Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 

30 therapeutics- 
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iMethods of characterizing the antisense binding properties of PNAs are 
discussed in Rose C^al Chem. 1993 Dec 1 5;65(24):3545-9) and Jensen et al 
(Biochemistr>'. 1997 Apr 22:36(1 6):5072-7). Rose uses capillars^ gel electrophoresis to 
determine binding of PNAs to their comp]ementar>^ oligonucleotide, measuring the 
5 relative binding kinetics and stoicliiometr}'. Similar types of measurements were made 
by Jensen et al using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 
apparent to the skilled artisan include use in DNA strand invasion, antisense inhibition, 
mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
10 transcriptionally active genes, blocking of transcription factor binding, genome 
cleavage, biosensors, w situ hybridization, and the like. 

Polynucleotide Identification. Characterization and Expression 

Polynucleotide compositions of the present invention may be identified, 
prepared and/or manipulated using any of a variety of well established techniques (see 

15 generally, Sambrook et aL, Molecular Cloning: A Laboratory Manual Cold Spring 
Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described in more detail below, by 
screening a microarray of cDNAs for tumor-associated expression {i.e., expression that 
is at least two fold greater in a tumor than in normal tissue, as determined using a 

20 representative assay provided herein). Such screens may be performed, for example, 
using the microarray teclmology of Affymetrix, Inc. (Santa Clara, CA) according to the 
manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl 
Acad Sci. USA Pi:10614-]06]9, 1996 and Heller et al., Froc. Natl Acad ScL USA 
9-^:2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA 

25 prepared from cells expressing the proteins described herein, such as tumor cells. 

Many template dependent processes are available to amplify a target 
sequences of interest present in a sample. One of the best known amplification methods 
is the polymerase chain reaction (PCR'^^0 which is described in detail in U.S. Patent 
Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by 
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reference in its entirety. Briefly, in PGR™, two primer sequences aie prepared which 
are complementary to regions on opposite complementary strands of the target 
sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g.. Tag polymerase). If the target sequence is present 
5 in a sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by adding on nucleotides. By raising 
and lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
10 transcription and PGR™ amplification procedure may be performed in order to quantify 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 
known in the art. 

Any of a number of other template dependent processes, many of which 
are variations of the PGR ™ amplification technique, are readily known and available in 

] 5 the art. Illustratively, some such methods include the ligase chain reaction (referred to 
as LCR), described, for example, in Eur. Pat. Appl. Publ. No. 320,308 and U.S. Patent 
No. 4,883,750; Qbeta Replicase, described in PCT Intl. Pat. AppL Publ. No. 
PCT/US87/00880; Strand Displacement Amplification (SDA) and Repair Chain 
Reaction (RCR). Still other amplification methods are described in Great Britain Pat. 

20 AppL No. 2 202 328, and in PGT Intl. Pat. Appl. Publ. No. PCT/US89/01025. Other 
nucleic acid amplification procedures include transcription-based amplification systems 
(TAS) (PCT Intl. Pat. Appl. Publ, No. WO 88/10315), including nucleic acid sequence 
based amplification (NASBA) and 3 SR. Eur. Pat. Appl. Publ. No. 329,822 describes a 
nucleic acid amplification process involving cyclically synthesizing single-stranded 

25 RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). PCT Inti. Pat. Appl. 
Publ. No. WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hybridization of a promoter/primer sequence to a target single-stranded DNA 
("ssDNA") followed by transcription of many RNA copies of the sequence. Other 
amplification methods such as "RACE" (Frohman, 1990), and "one-sided PGR" (Ohara, 

30 1 989) are also well-known to those of skill in the art. 
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An amplified portion of a polynucleotide of the present invention may be 
used to isolate a full length gene from a suitable library^ (^.g., a tumor cDNA librar}^) 
using well known tecliniques. Within such techniques, a librar>' (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 
5 Preferably, a library is size-selected to include larger molecules. Random primed 
libraries may also be preferred for identifying 5' and upstream regions of genes. 
Genomic libraries are preferred for obtaining inlrons and extending 5' sequences. 

For hybridization techniques, a partial sequence may be labeled (e,g,, by 
nick-translation or end-labeling with ^^P) using well known techniques. A bacterial or 

10 bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
(see Sambrook et al.. Molecular Clonuig: A Lahoratory Mcmval Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 

1 5 be analyzed to determine the amount of additional sequence by, for example, PCR using 
a primer fi-om the partial sequence and a primer from the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. 7he 
complete sequence may then be determined using standard techniques, which may 
involve generating a series of deletion clones. The resulting overlapping sequences can 

20 then assembled into a single contiguous sequence. A full length cDNA molecule can be 
generated by ligating suitable fragments, using well known techniques. 

Alternatively, amplification techniques, such as those described above, 
can be useful for obtaining a full length coding sequence from a partial cDNA sequence. 
One such amplification technique is inverse PCR {see Triglia et aL, Nucl Acids Res. 

25 76:8186, 1988), which uses restricfion enzymes to generate a fragment in the knowTi 
region of the gene. The fragment is then circularized by intramolecular ligation and 
used as a template for PCR with divergent primers derived from the known region. 
Within an alternative approach, sequences adjacent to a partial sequence may be 
retrieved by amplification with a primer to a linker sequence and a primer specific to a 

>0 known region. The amplified sequences are typically subjected to a second round of 
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amplification with the same Hnker primer and a second primer specific to the known 
region. A variation on this procedure, which employs two primers that initiate 
extension in opposite directions from the known sequence, is described in WO 
96/38591. Another such technique is known as "rapid amplitication of cDNA ends" or 
5 RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5' 
and 3' of a known sequence. Additional techniques include capture PGR (Lagerstrom et 
al., PCR Methods Applic, 7:111-19, 1991) and walking PGR (Parker et al, Niicl Acids. 
Res. 79:3055-60, 1991). Other methods employing amplification may also be employed 

10 to obtain a full length cDNA sequence. 

In certain instances, it is possible to obtain a full length cDNA sequence 
by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
that available from GenBank. Searches for overlapping ESTs may generally be 
performed using well known programs {e.g., NGBI BLAST searches), and such ESTs 

15 may be used to generate a contiguous full length sequence. Full length DNA sequences 
may also be obtained by analysis of genomic fragments. 

In other embodiments of the invention, polynucleotide sequences or 
fragments thereof which encode polypeptides of the invention, or fusion proteins or 
functional equivalents thereof, may be used in recombinant DNA molecules to direct 

20 expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
be used to clone and express a given polypeptide. 

As will be xmderstood by those of skill in the art, it may be advantageous 

25 in some instances to produce polypeptide-encoding nucleotide sequences possessing 
non-naturally occurring codons. For example, codons preferred by a particular 
prokaryotic or eukaryotic host can be selected to increase the rate of protein expression 
or to produce a recombinant RNA transcript having desirable properties, such as a half- 
life which is longer than that of a transcript generated from the naturally occurring 

30 sequence. 
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Moreover, the polynucleotide sequences of the present invention can be 
engineered using methods generally known in the art in order to alter polypeptide 
encoding sequences for a \ ariety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 
5 example, DNA shuffling by random fragmentation and PCR reassembly of gene 
fragments and synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences may be ligated to a heterologous sequence to 
encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
polypeptide activit>^ it may be useful to encode a chimeric protein that can be 
recognized by a commercially available antibody. A fusion protein may also be 

15 engineered to contain a cleavage site located between the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. 

Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. 

20 (1980) Nucl. Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) NucL Acids Res. 
Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical 
methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
For example, peptide synthesis can be performed using various solid-phase tecloniques 
(Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be 

25 achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, Palo 
Alto, CA). 

A newly synthesized peptide may be substantially purified by preparative 
high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures 
and Molecular Principles, Freeman and Co., New York, N.Y.) or other comparable 
30 techniques available in the art. The composition of the synthetic peptides may be 
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confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 
5 In order to express a desired polypeptide, the nucleotide sequences 

encoding the polypeptide, or functional equivalents, may be inserted into appropriate 
expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
known to those skilled in the art may be used to constmct expression vectors containing 

10 sequences encoding a polypeptide of interest and appropriate ti*anscriptional and 
translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and m vivo genetic recombination. Such techniques 
are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 

15 al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. 
N.Y. 

A variety of expression vector/host systems may be utilized to contain 
and express polynucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 

20 or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 
insect cell systerns infected with virus expression vectors (e.g.^ baculovirus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors {e.g.^ Ti or 
pBR322 plasmids); or animal cell systems. 

25 The "control elements" or "regulatory sequences" present in an 

expression vector are those non-translated regions of the vector— enhancers, promoters, 
5' and 3' untranslated regions— which interact with host cellular proteins to carry out 
transcription and translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 

30 and translation elements, including constitutive and inducible promoters, may be used. 
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For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
lacZ promoter of the PBLUESCRJPT phagemid (Stratagene, La Jolla, Calif.) or 
PSPORT] plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from mammalian viruses 
^ are generally prefenred. If it is necessary to generate a cell line that contains multiple 
copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be 
advantagcousK' used with an appropriate selectable marker. 

In bacterial systems, any of a number of expression vectors may be 
selected dc|xrndmg upon the use intended for the expressed polypeptide. For example, 

10 when large quantities are needed, for example for the induction of antibodies, vectors 
which direct high lc\ el expression of fusion proteins that are readily purified may be 
used. Such \cclors include, but are not limited to. the muhifunctional E. coli cloning 
and expression vectors such as BLUESCRJPT (Stratagene), in which the sequence 
encoding the pi>l>peptide of interest may be ligated into the vector in frame with 

15 sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. 
M. Schuster (1989) J. BioL Chem. 264:5503-5509); and the like. pGEX Vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 
proteins with glutathione S-transferase (GST). In general, such fusion proteins are 

20 soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 
systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
EnzymoL 153:516-544. 

In cases where plant expression vectors are used, the expression of 
M> sequences encoding polypeptides may be driven by any of a number of promoters. For 
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example, viral promoters such as the 35S and 19S promoters of CaMV may be used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. 
(1987) EMBO J. 6:307-31 1. Alternatively, plant promoters such as the small subunit of 
RUBISCO or heat shock promoters may be used (Coruzzi, Q. et al. (1984) EMBO J. 
5 5:1671-1680; Broglie, R. et al. (1984)5^/^;?^^ 22^:838-843; and Winter, J. et al. (1991) 
Results Probl Cell Differ. 77:85-105). These constructs can be introduced into plant 
cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 
are described in a number of generally available reviews (see, for example, Hobbs, S. or 
Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 

10 Hill, New York, N.Y.; pp. 191-196). 

An insect system may also be used to express a polypeptide of interest. 
For example, in one such system, Autographa califomica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 

15 non-essential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 

20 (Engelhard, E. K. et al. (1994) Proc, Natl Acad. ScL 91 :3224-3227). 

In mammalian host cells, a number of viral-based expression systems are 
generally available. For example, in cases where an adenovirus is used as an expression 
vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 

25 sequence. Insertion in a non-essential Bl or E3 region of the viral genome may be used 
to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, J. and Shenk, T. (1984) Proc. Natl Acad. ScL 57:3655-3659). In addition, 
transcription enliancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 
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Specific initiation signals may also be used to achieve more efficient 
translation of sequences encoding a polypeptide of interest. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into the 
3 appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the initiation codon should be in the correct 
reading frame to ensure translation of the entire insert. Exogenous translational 

10 elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers which are 
appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al. (1994) Results Probl Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate 

1 5 the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/or function. Different host cells such as 

20 CHO, COS, HeLa, MDCK, HEK293, and WI38, which have specific cellular machinery 
and characteristic mechanisms for such post-translational activities, may be chosen to 
ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is generally prefeired. For example, cell lines which stably express a 

25 polynucleotide of interest may be transformed using expression vectors which may 
contain viral origins of replication and/or endogenous expression elements and a 
selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allow^ed to grow for 1-2 days in an enriched media before 
they are switched to selective media. The purpose of the selectable marker is to confer 

30 resistance to selection, and its presence allows growth and recovery of cells wliich 
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successfttlly express the introduced sequences. Resistant clones of stably transformed 
cells may be proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thymidine 
5 kinase (Wigler, M. et al. (1977) Cell 77:223-32) and adenine phosphoribosyltransferase 
(Lowy , 1. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.- or 
aprt.sup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 
be used as the basis for selection; for example, dhfr which confers resistance to 
melholrexate (Wigler, M. et al. (1980) Proc. Natl Acad. ScL 77:3567-70); npt, which 

10 confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et 
al J Mol. Biol 750:1-14); and als or pat, which confer resistance to 

chlorsulfuron and phosphinotricin acetyltransferase, respectively (Mitrry, supra). 
Additional selectable genes have been described, for example, trpB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 

15 place of hislidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl Acad, Sci. 
^5:8047-51). The use of visible markers has gained popularity with such markers as 
anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its substrate 
luciferin, being widely used not only to identify transformants, but also to quantify the 
amount of transient or stable protein expression attributable to a specific vector system 

20 (Rhodes, C. A. et al. (1995) Methods Mol Biol 55:121-131). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the 

25 absence of marker gene function. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence under the control of a single promoter. 
Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells that contain and express a desired 

30 polynucleotide sequence may be identified by a variety of procedures known to those of 
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skill in the art. These procedures include, but aie not limited to, DNA-DNA or DNA- 
KNA hybridizations and protein bioassay or immunoassay techniques which include, 
tor example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 
specific for the product are known in the art. Examples include enzyme-linked 
nrtmunosorbent assay (ELISA), radioimmunoassay (RJA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 
preferred for some applications, but a competitive binding assay may also be employed. 
These and other assays are described, among other places, in Hampton, R. et al. (1990; 
Serological Methods, a Laboratoiy Manual, APS Press, St Paul. Minn.) and Maddox, D. 
H. et al. (1983; J. Exp. Meet 75(b': 121 1-1216). 

A wide variety^ of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid assays. Means 
for producing labeled hybridization or PGR probes for detecting sequences related to 
polynucleotides include oligolabeling, nick translation, end-labeling or PGR 
amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enz>mies, fluorescent, chemiluminescent, or chromogenic agents 
as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with a polynucleotide sequence of interest may be 
cultured under conditions suitable for the expression and recovery of the protein from 
cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 
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by those of skill in the art, expression vectors containing polynucleotides of the 
invention may be designed to contain signal sequences which direct secretion of the 
encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 

5 interest to nucleotide sequence encoding a polypeptide domain which will facilitate 

i 

purification of soluble proteins. Such purification facilitating domains include, but are 
not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 

10 purification system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 
Calif) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 

1 5 residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as 
described in Porath, J. et al. (1992, Prot, Exp. Purif 5:263-281) while the enterokinase 
cleavage site provides a means for purifying the desired polypeptide from the fusion 
protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 

20 et al. (1993; DNA Cell Biol 72:441-453). 

In addition to recombinant production methods, polypeptides of the 
invention, and fragments thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merrifield J. (1963) J. Am, Chem. Soc. 55:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 

25 synthesis may be achieved, for example, using Applied Biosy stems 431 A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 
synthesized separately and combined using chemical methods to produce the full length 
molecule. 
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. \jitibodv Compositions. Fragments Thereof and Other Binding Agents 

According to another aspect, the present invention further provides 
binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit 
immunologica] binding to a tumor polypeptide disclosed herein, or to a portion, variant 
5 or derivative thereof. .An antibody, or antigen-binding fragment thereof, is said to 
"specifically bind," ''immunogically bind," and/or is 'immunologically reactive'' to a 
polypeptide of the invention if it reacts at a detectable level (within, for example, an 
KLISA assay) with the polypeptide, and does not react detectably with unrelated 
polypeptides under similar conditions. 

Immunological binding, as used in this context, generally refers to the 
non-covalent interactions of the tv^pe which occur between an immunoglobulin 
molecule and an antigen for which the immunoglobulin is specific. The strength, or 
affinity of immunological binding interactions can be expressed in terms of the 
dissociation constant (Kj) of the interaction, wherein a smaller Kd represents a greater 
15 affinity. Immunological binding properties of selected polypeptides can be quantified 
using methods well known in the art. One such method entails measuring the rates of 
antigen-binding site/antigen complex formation and dissociation, wherein those rates 
depend on the concentrations of the complex partners, the affinity of the interaction, and 
on geometric parameters that equally influence the rate in both direcfions. Thus, both 
20 the "on rate constant" (Kon) and the "off rate constant" (Koff) can be determined by 
calculation of the concentrations and the actual rates of association and dissociation. 
The ratio of Koff /Kon enables cancellation of all parameters not related to affinity, and is 
thus equal to the dissociation constant Kd. See, generally, Davies et al. (1990) Annual 
Rev. Biochem. 59:439-473. 

An "antigen-binding site," or "binding portion" of an antibody refers to 
the part of the immunoglobulin molecule that participates in antigen binding. The 
antigen binding site is formed by amino acid residues of the N-terminal variable ("V") 
regions of the heav^ ("H") and light ("L") chains. Three highly divergent stretches 
within the V regions of the heavy and light chains are referred to as "hyper\'ariable 
regions" which are interposed between more conserv^ed flanking stretches knowTi as 
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"framework regions," or "FRs". Thus the term "FR" refers to amino acid sequences 
which are naturally found between and adjacent to hypervariable regions in 
immunoglobulins. In an antibody molecule, the three hypervariable regions of a light 
chain and the three hypervariable regions of a heavy chain are disposed relative to each 
5 other in three dimensional space to form an antigen-binding surface. The antigen- 
binding surface is complementary to the three-dimensional surface of a bound antigen, 
and the three hypervariable regions of each of the heavy and light chains are referred to 
as "complementarity-determining regions," or "CDRs." 

Binding agents may be further capable of differentiating between patients 

10 with and without a cancer, such as prostate cancer, using the representative assays 
provided herein. For example, antibodies or other binding agents that bind to a tumor 
protein will preferably generate a signal indicating the presence of a cancer in at least 
about 20% of patients with the disease, more preferably at least about 30% of patients. 
Alternatively, or in addition, the antibody will generate a negative signal indicating the 

15 absence of the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies this requirement, biological samples (e.g., 
blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a 
cancer (as determined using standard clinical tests) may be assayed as described herein 
for the presence of polypeptides that bind to the binding agent. Preferably, a statistically 

20 significant number of samples with and without the disease will be assayed. Each 
binding agent should satisfy the above criteria; however, those of ordinary skill in the 
art will recognize that binding agents may be used in combination to improve 
sensitivity. 

Any agent that satisfies the above requirements may be a binding agent. 

25 For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or an antigen-binding fragment thereof Antibodies may be prepared by any of 
a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In 

30 general, antibodies can be produced by cell culture techniques, including the generation 
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ol monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 
recombinant antibodies. In one technique, an immunogen comprising the polypeptide is 
initially injected into any of a wide variet>' of manimals (e.g.. mice, rats, rabbits, sheep 
5 or goats). In this step, the polypeptides of this invention may serve as the immunogen 
without modification. Alternatively, particularly for relatively short polypeptides, a 
superior immune response may be elicited if the polypeptide is joined to a carrier 
protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen 
is injected into the animal host, preferably according to a predetermined schedule 
0 incorporating one or more booster immimizations, and the animals are bled periodically. 
Polyclonal antibodies specific for the polypeptide may then be purified from such 
mtisera by. for example, affinity cliromatography using the polypeptide coupled to a 
suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest 
5 may be prepared, for example, using the technique of Kohler and Milstein, Eur J. 
Immunol 6:511-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the 
desired specificity {i.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
0 described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 
5 cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about ] to 2 weeks, 
colonies of hybrids are obser\'ed. Single colonies are selected and their culture 
supcmatants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificit}^ are preferred. 
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Monoclonal antibodies may be isolated from the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
5 the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 

A number of therapeutically useful molecules are known in the art which 

10 comprise antigen-binding sites that are capable of exhibiting immunological binding 
properties of an antibody molecule. The proteolytic enzyme papain preferentially 
cleaves IgG molecules to yield several fragments, two of which (the "F(ab)'* fragments) 
each comprise a covalent heterodimer that includes an intact antigen-binding site. The 
enzyme pepsin is able to cleave IgG molecules to provide several fragments, including 

15 the "F(ab')2 " fragment which comprises both antigen-binding sites. An "Fv" fragment 
can be produced by preferential proteolytic cleavage of an IgM, and on rare occasions 
IgG or IgA immunoglobulin molecule. Fv fragments are, however, more commonly 
derived using recombinant techniques known in the art. The Fv fragment includes a 
non-covalent Vh-Vl heterodimer including an antigen-binding site which retains much 

20 of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et al. (1972) Proc. Nat Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) 
Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

A single chain Fv ("sFv") polypeptide is a covalently linked Vh::Vl 
heterodimer which is expressed from a gene fusion including Vh- and Vt-encoding 

25 genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
USA 85(16):5879-5883. A number of methods have been described to discern chemical 
structures for converting the naturally aggregated— but chemically separated— light and 
heavy polypeptide chains from an antibody V region into an sFv molecule which will 
fold into a three dimensional structure substantially similar to the structure of an 
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antigen-bindmg site. See, e.g., U.S. Pat. Nos. 5,091.513 and 5,132.405. to Huston et al.: 
and II. S. Pat. No. 4,946,778, to Ladner et al. 

Each of the above-described molecules includes a heaw chain and a 
light chain CDR set. respectively interposed between a hea\7 chain and a light chain FR 
5 set which provide support to the CDRS and define the spatial relationship of the CDRs 
relative to each other. As used herein, the term "CDR set" refers to the three 
hypervariable regions of a heaw or light chain V region. Proceeding from the N- 
temiinus of a heaw or light chain, these regions are denoted as "CDRl," "CDR2," and 
"CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 
10 comprising the CDR set from each of a heavy and a light chain V region. A polypeptide 
comprising a single CDR, {e.g., a CDRl, CDR2 or CDR3) is referred to herein as a 
"molecular recognition unit." Crystallographic analysis of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs fom extensive 
contact with bound antigen, wherein the most extensive antigen contact is with the 
15 heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for 
the specificity of an antigen-binding site. 

As used herein, the term "FR set" refers to the four flanking amino acid 
sequences which frame the CDRs of a CDR set of a hea\^ or light chain V region. 
Some FR residues may contact bound antigen; however, FRs are primarily responsible 
20 lor folding the V region into the antigen-binding site, particularly the FR residues 
directly adjacent to the CDRS. Within FRs, certain amino residues and certain structural 
features are veiy highly conserved. In this regard, all V region sequences contain an 
internal disulfide loop of around 90 ajuino acid residues. When the V regions fold into a 
binding-site, the CDRs are displayed as projecting loop motifs which form an antigen- 
25 binding surface. It is generally recognized that there are conserved structural regions of 
FRs which influence the folded shape of the CDR loops into certain "canonical" 
structures-regardless of the precise CDR amino acid sequence. Further, certain FR 
residues are known to participate in non-covalent interdomain contacts which stabilize 
the interaction of the antibody heaw and light chains. 
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A number of "humanized" antibody molecules comprising an antigen- 
binding site derived from a non-human immunoglobulin have been described, including 
chimeric antibodies having rodent V regions and their associated CDRs fused to human 
constant domains (Winter et al. (1991) Nature 349:293-299; Lobuglio et aL (1989) 

5 Proc. Nat. Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J Immunol. 138:4534- 
4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a 
human supporting FR prior to fusion with an appropriate human antibody constant 
domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs 

10 supported by recombinantly veneered rodent FRs (European Patent Publication No. 
519,596, published Dec. 23, 1992). These "humanized" molecules are designed to 
minimize unwanted immunological response toward rodent antihuman antibody 
molecules which limits the duration and effectiveness of therapeutic applications of 
those moieties in human recipients. 

1 5 As used herein, the terms "veneered FRs" and "recombinantly veneered 

FRs" refer to the selective replacement of FR residues from, e,g,, a rodent heavy or light 
chain V region, with human FR residues in order to provide a xenogeneic molecule 
comprising an antigen-binding site which retains substantially all of the native FR 
polypeptide folding structure. Veneering techniques are based on the understanding that 

20 the ligand binding characteristics of an antigen-binding site are determined primarily by 
the structure and relative disposition of the heavy and light chain CDR sets within the 
antigen-binding surface. Davies et al. (1990) Ann. Rev. Biochem. 59:439-473. Thus, 
antigen binding specificity can be preserved in a humanized antibody only wherein the 
CDR structures, their interaction with each other, and their interaction with the rest of 

25 the V region domains are carefully maintained. By using veneering techniques, exterior 
{e,g,, solvent-accessible) FR residues which are readily encountered by the immune 
system are selectively replaced with human residues to provide a hybrid molecule that 
comprises either a weakly immunogenic, or substantially non-inmiunogenic veneered 
surface. 
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The process of veneering makes use of the a\ aiJab]e sequence data for 
human antibody variable domains compiled by Kabat et ah, in Sequences of Proteins of 
Immunological Interest. 4th ed., (U.S. Dept. of Health and Human Ser^•ices, U.S. 
Government Printing Office, 1987), updates to the Kabat database, and other accessible 
5 U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be deduced from the known tluee-dimensional structure for 
human and murine antibody fragments. There are tvvo general steps in veneering a 
murine antigen-binding site. Initially, the FRs of the variable domains of an antibody 
molecule of interest are compared with corresponding FR sequences of human variable 
10 domains obtained from the above-identified sources. The most homologous human V 
regions are then compared residue by residue to corresponding murine amino acids. The 
residues in the murine FR which differ from the human countei-part are replaced by the 
residues present in the human moiety using recombinant techniques well known in the 
art. Residue switching is only canned out with moieties w^hich are at least partially 
1 5 exposed (solvent accessible), and care is exercised in the replacement of amino acid 
residues which may have a significant effect on the tertiary structure of V region 
domains, such as proline, glycine and charged amino acids. 

In this manner, the resultant "veneered" murine antigen-binding sites are 
thus designed to retain the murine CDR residues, the residues substantially adjacent to 
20 the CDRs, the residues identified as buried or mostly buried (solvent inaccessible), the 
residues believed to participate in non-covalent (e.g,, electrostatic and hydrophobic) 
contacts between heavy and light chain domains, and the residues from conserved 
structural regions of the FRs which are believed to influence the "canonical" tertiary 
structures of the CDR loops. These design criteria are then used to prepai e recombinant 
25 nucleotide sequences w^hich combine the CDRs of both the hea^^ and light chain of a 
murine antigen-binding site into human-appearing FRs that can be used to transfect 
mammalian cells for the expression of recombinant human antibodies which exhibit the 
antigen specificity of the murine antibody molecule. 

In another embodiment of the invention, monoclonal antibodies of the 
30 present invention may be coupled to one or more therapeutic agents. Suitable agents in 
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this regard include radionuclides, differentiation inducers, dnigs, toxins, and derivatives 
thereof- Preferred radionuclides include ^^Y, ^^^I, ^^^I, ^^'l, ^^^Re, ^^^Re, ^'^At, and 
^''^Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 
5 toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas 
exotoxin. Shigella toxin, and pokeweed antiviral protein. 

A therapeutic agent may be coupled {e.g,, covalently bonded) to a 
suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A 
direct reaction between an agent and an antibody is possible when each possesses a 

1 0 substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- 
containing group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group {e.g., a halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 

15 antibody via a linker group. A linker group can function as a spacer to distance an 
antibody from an agent in order to avoid interference with binding capabilities. A linker 
group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibody, and thus increase the coupling efficiency. An increase in chemical 
reactivity may also facilitate the use of agents, or functional groups on agents, which 

20 otherwise would not be possible. 

It v^ll be evident to those skilled in the art that a variety of bifiinctional 
or polyfanctional reagents, both homo- and hetero-functional (such as those described in 
the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl groups, 

25 sulfhydryl groups or oxidized carbohydrate residues. There are numerous references 
describing such methodology, ^.g., U.S. Patent No. 4,671,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody 
portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon internalization into a cell A number of 

30 different cleavable linker groups have been described. The mechanisms for the 
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intracellular release of an agent from these linker groups include cleavage by reduction 
of a disulfide bond {e.g., U.S. Patent No. 4.489,710, to Spitler), by irradiation of a 
photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter et al.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by 
5 serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671 ,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et ah). 

It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 

10 Regardless of the paiticular embodiment, immunoconjugates with more than one agent 
may be prepared in a variet>^ of ways. For example, more than one agent may be 
coupled directly to an antibody molecule, or linkers that provide multiple sites for 
attacliment can be used. Altematively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 

15 bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins {e.g., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
such as aminodextran (e.g., U.S. Patent No. 4,699,784, to Shih et ah). A cairier may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g,, U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 

20 radionuclide agents include radiohalogenated small molecules and chelating 
compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
formed from chelating compounds that include those containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 

25 example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 
compounds and their synthesis. 



1 Cell Compositions 

The present invention, in another aspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof. Such cells 
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may generally be prepared /;? vitro or ex vivo, using standard procedures. For example, 
T cells may be isolated from bone niairow, peripheral blood, or a fraction of bone 
marrow or peripheral blood of a patient, using a commercially available cell separation 
system, such as the Isolex'^'^ System, available from Nexell Therapeutics, Inc. (Irvine, 
5 CA; see also U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or 
unrelated humans, non-human mammals, cell lines or cultures. 

T cells may be stimulated with a polypeptide, polynucleotide encoding a 
polypeptide anJ/or an antigen presenting cell (APC) that expresses such a polypeptide. 

10 Such siimulalion is perlbrmed under conditions and for a time sufficient to permit the 
generation of! cells that are specific for the polypeptide of interest. Preferably, a tumor 
polypeptide or poN nucleotide of the invention is present within a delivery vehicle, such 
as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the present 

15 invention if the T cells specifically proliferate, secrete cytokines or kill target cells 
coated with the polypeptide or expressing a gene encoding the polypeptide. T cell 
specificit}^ may be evaluated using any of a variety of standard tecliniques. For 
example, within a chromium release assay or proliferation assay, a stimulation index of 
more than two fold increase in lysis and/or proliferation, compared to negative controls, 

20 indicates T cell specificity. Such assays may be performed, for example, as described in 
Chen et ah, Cancer Res. 5¥:I065-1070, 1994. Alternatively, detection of the 
proliferation of T cells may be accomplished by a variety of known techniques. For 
example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and 

25 measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
tumor polypeptide (100 ng/ml - 100 jig/ml, preferably 200 ng/ml - 25 |ig/ml) for 3 - 7 
days will typically result in at least a two fold increase in proliferation of the T cells. 
Contact as described above for 2-3 hours should result in activation of the T cells, as 
measured using standard cytokine assays in which a two fold increase in the level of 

30 cytokine release (e.g., TNF or IFN-y) is indicative of T cell activation (see Coligan et 
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aJ-, CuiTent Protocols in Immunology, vol. 1, Wiley Interscience (Greene 1998)). T 
cells that have been activated in response to a tumor polypeptide, polynucleotide or 
polypeptide-expressing APC may be CD4* and/or CDS*. Tumor polypeptide-specific T 
cells may be expanded using standard techniques. Within preferred embodiments, the T 
5 cells are derived from a patient, a related donor or an unrelated donor, and are 
administered to the patient following stimulation and expansion. 

For therapeutic purposes, CD4'^ or CD8"^ T cells that proliferate in 
response to a tumor polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 

10 v ariety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 
without the addition of T cell growlh factors, such as interleukin-2. and^or stimulator 
cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 

15 cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 

Ph armaceutical Compositions 

In additional embodiments, the present invention concerns foixnulation 
of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions 

20 disclosed herein in phaiTnaceutically-acceptable carriers for administration to a cell or 
an animal, either alone, or in combination with one or more other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed herein 
ma\ be administered in combination with other agents as well, such as, e.g., other 
proteins or polypeptides or various pharmaceutically-active agents. In fact, there is 

25 virtually no limit to other components that may also be included, given that the 
additional agents do not cause a significant adverse effect upon contact with the target 
cells or host tissues. The compositions may thus be delivered along with various other 
agents as required in the particular instance. Such compositions may be purified from 
host cells or other biological sources, or alternatively may be chemically synthesized as 
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described herein. Likewise, such compositions may further comprise substituted or 
derivatized RNA or DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 
compositions are provided comprising one or more of the polynucleotide, polypeptide, 
5 antibody, and/or T-cell compositions described herein in combination with a 
physiologically acceptable canier. In certain preferred embodiments, the 
pharmaceutical compositions of the invention comprise immunogenic polynucleotide 
and/or polypeptide compositions of the invention for use in prophylactic and theraputic 
vaccine applications. Vaccine prepaiation is generally described in, for example, M.F, 

10 Powell and M.J. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," 
Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 
polynucleotide and/or polypeptide compositions of the present invention in combination 
with one or more immunostimulants. 

It will be apparent that any of the pharmaceutical compositions described 

15 herein can contain pharmaceutically acceptable salts of the polynucleotides and 
polypeptides of the invention. Such salts can be prepared, for example, from 
pharmaceutically acceptable non-toxic bases, including organic bases (e.g., salts of 
primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e.g., 
sodium, potassium, lithium, ammonium, calcium and magnesium salts). 

20 In another embodiment, illustrative immunogenic compositions, e,g.^ 

vaccine compositions, of the present invention comprise DNA encoding one or more of 
the polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of delivery 
systems known to those of ordinary skill in the art. Indeed, numerous gene delivery 

25 techniques are well known in the art, such as those described by Rolland, Crit. Rev, 
Therap. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 
Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 
promoter and terminating signal). Alternatively, bacterial delivery systems may involve 
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the administration of a bacterium (such as Bacillus-Calmette-Guerhn) that expresses an 
immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding 
immunogenic polypeptides described herein are introduced into suitable mammalian 
5 host cells for expression using any of a number of known viral-based systems. In one 
illustrative embodiment, retroviruses provide a convenient and effective platform for 
gene de]iver>' systems. A selected nucleotide sequence encoding a polypeptide of the 
present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered 
10 to a subject. A number of illustrative retroviral systems have been described {e.g., U.S. 
Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et af (1991) Virology 180:849-852; Bums 
et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet. Develop. 3:102-109. 
^ 5 In addition, a number of illustrative adenovirus-based systems have also 

been described. Unlike retroviruses which integrate into the host genome, adenoviruses 
persist extrachromosomally thus minimizing the risks associated with insertional 
mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. 
Virol. 67:591 1-5921 ; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 
20 al. (1994) J. Virol. 68:933-940; Bair et ah (1994) Gene Therapy 1:51-58; Berkner, K. L. 
(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 
developed for polynucleotide delivery. AAV vectors can be readily constructed using 
25 techniques well known in the art. See, e.^., U.S. Pat. Nos. 5,173,414 and 5,139,941; 
International Publication Nos. WO 92/01070 and WO 93/03769: Lebkowski et al. 
(1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Hariw Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533- 
539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 158:97-129; 
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Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene 
Therapy 1:165-169; and Zhou et aL(1994) J. Exp. Med. 179:1867-1875. 

Additional viral vectors useful for delivering the polynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 
5 from the pox family of viruses, such as vaccinia virus and avian poxvirus. By way of 
example, vaccinia virus recombinants expressing the novel molecules can be 
constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 

10 then used to transfect cells which are simultaneously infected with vaccinia. 
Homologous recombination serves to insert the vaccinia promoter plus the gene 
encoding the polypeptide of interest into the viral genome. The resulting TK.sup.(-) 
recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

1 5 A vaccinia-based infection/transfection system can be conveniently used 

to provide for inducible, transient expression or coexpression of one or more 
polypeptides described herein in host cells of an organism. In this particular system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 

20 that it only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the polynucleotide or polynucleotides of interest, driven by a T7 
promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high level, 

25 transient, cytoplasmic production of large quantities of RNA and its translation 
products. See, eg., Elroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 
6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Ahematively, avipoxviruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding sequences of interest. Recombinant avipox 

30 viruses, expressing immunogens from mammalian pathogens, are known to confer 
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protective immunity when administered to non-avian species. The use of an Avipox 
vector is particularly desirable in human and other mammalian species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
therefore are not infective in mammalian cells. Methods for producing recombinant 
> Avipoxviruses are known in the art and employ genetic recombination, as described 
above with respect to the production of vaccinia vinises. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545, 

Any of a number of alphavirus vectors can also be used for delivery of 
polynucleotide compositions of the present invention, such as those vectors described in 
10 U.S. Patent Nos. 5,843,723: 6,015,686; 6,008,035 and 6.015,694. Certain vectors based 
on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643.576. 

Moreover, molecular conjugate vectors, such as the adenovirus chimeric 
vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
15 ah Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 
under the invention. 

Additional illustrative information on these and other known viral-based 
delivery systems can be found, for example, in Fisher-Hoch et ah, Proc. Natl. Acad. Sci. 
USA 5(5:317-321, 1989; Flexner et ah, Ann. N.Y. Acad Sci. 56^:86-103, 1989; Flexner 
20 et ah. Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603 J 12, 4,769,330, and 5,017,487; 
WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques 6:616-627, 1988; Rosenfeld et ah. Science 
252:431-434, 1991; Kolls et al., Proc. Natl. Acad, Sci. USA 5^7:215-219, 1994; 
Kass-Eisler et al., Proc. Natl Acad Sci. USA P0:1 1498-1 1 502, 1993; Guzman et aL, 
25 Circulation 55:2838-2848, 1993; and Guzman et al., Cir. Res. 75:1202-1207, 1993. 

In certain embodiments, a polynucleotide may be integrated into the 
genome of a target cell. This integration may be in a specific location and orientation 
via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the 
polynucleotide may be stably maintained in the cell as a separate, episomal segment of 
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DNA. Such polynucleotide segments or "episomes" encode sequences sufficient to 
permit maintenance ajid replication independent of or in synchronization with the host 
cell cycle. The manner in which the expression construct is delivered to a cell and 
where in the cell the polynucleotide remains is dependent on the type of expression 
5 construct employed. 

In another embodiment of the invention, a polynucleotide is 
administered/delivered as ''naked" DNA, for example as described in Ulmer et al.. 
Science 259:1745-1749, 1993 and reviewed by Cohen, Science 259:1691-1692, 1993. 
The uptake of naked DNA may be increased by coating the DNA onto biodegradable 

10 beads, which are efficiently transported into the cells. 

In still another embodiment, a composition of the present invention can 
be delivered via a particle bombardment approach, many of which have been described. 
In one illustrative example, gas-driven particle acceleration can be achieved with 
devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 

15 and Powderject Vaccines Inc. (Madison, WI), some examples of which are described in 
U.S. Patent Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 
799. This approach offers a needle-free delivery approach wherein a dry powder 
formulation of microscopic particles, such as polynucleotide or polypeptide particles, 
are accelerated to high speed within a helium gas jet generated by a hand held device, 

20 propelling the particles into a target tissue of interest. 

In a related embodiment, other devices and methods that may be useful 
for gas-driven needle-less injection of compositions of the present invention include 
those provided by Bioject, Inc. (Portland, OR), some examples of which are described 
in U.S. Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 

25 and 5,993,412. 

According to another embodiment, the pharmaceutical compositions 
described herein will comprise one or more immunostimulants in addition to the 
immunogenic polynucleotide, polypeptide, antibody, T-cell and/or AFC compositions 
of this invention. An immunostimulant refers to essentially any substance that enhances 
30 or potentiates an immune response (antibody and/or cell-mediated) to an exogenous 
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antigen. One prefened type of immunostinnilant comprises an adju\ant. Many 
adjuvants contain a substance designed to protect the antigen from rapid catabolism, 
such as aluminum hydroxide or mineral oiK and a stimulator of immune responses, such 
as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 
5 Certain adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 
65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKIine Beechani, Pliiladelphia, 
PA); aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; 
salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated 
10 sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 
CjM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be used as 
adjuvants. 

Within certain embodiments of the invention, the adjuvant composition 

15 is preferably one that induces an immune response predominantly of the Thl type. High 
levels of Thl-type cytokines {e.g., IFN-y, TNFa, lL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 
high levels of Th2-t>^pe cytokines {e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the 
induction of humoral immune responses. Following application of a vaccine as 

20 provided herein, a patient will support an immune response that includes Thl - and Th2- 
type responses. Within a preferred embodiment, in which a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent than the level 
of Th2-type cytokines. The levels of these cytokines may be readily assessed using 
standard assays. For a review of the families of cytokines, see Mosmann and Coffman, 

25 Aim. Rev. hmmmol. 7:145-173, 1989. 

Certain preferred adjuvants for eliciting a predominantly Thl-type 
response include, for example, a combination of monophosphoi7l lipid A, preferably 3- 
de-O-acylated monophosphoryl lipid A, together with an aluminum salt. MPL^ 
adjuvants are available from Corixa Corporation (Seattle, WA; see, for example, US 

^0 Patent Nos. 4,436,727; 4.877,611; 4,866,034 and 4,912,094). CpG-containing 
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oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a 
predominantly Thl response. Such oligonucleotides are well known and are described, 
for example, in WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 
5,856,462. Inimunostimulatory DNA sequences are also described, for example, by 
5 Sato et aL, Science 275:352, 1996. Another prefeired adjuvant comprises a saponin, 
such as Quil A, or derivatives thereof, including QS21 and QS7 (Aquila 
Biopharmaceuticals Inc., Framingham, MA); Escin; Digitonin; or Gypsophila or 
Chenopodium quinoa saponins . Other preferred formulations include more than one 
saponin in the adjuvant combinations of the present invention, for example 

10 combinations of at least two of the following group comprising QS21, QS7, Quil A, (i- 
escin, or digitonin. 

Alternatively the saponin formulations may be combined with vaccine 
vehicles composed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix, 

15 particles composed of polysaccharides or chemically modified polysaccharides, 
liposomes and lipid-based particles, particles composed of glycerol monoesters, etc. The 
saponins may also be formulated in the presence of cholesterol to foim particulate 
structures such as liposomes or ISCOMs. Furthermore, the saponins may be formulated 
together with a polyoxyethylene ether or ester, in either a non-particulate solution or 

20 suspension, or in a particulate structure such as a paucilamelar liposome or ISCOM. The 
saponins may also be formulated with excipients such as Carbopol^ to increase 
viscosity, or may be formulated in a dry powder form with a powder excipient such as 
lactose. 

In one preferred embodiment, the adjuvant system includes the 
25 combination of a monophosphoryl lipid A and a saponin derivative, such as the 
combination of QS21 and 3D-MPL® adjuvant, as described in WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched with cholesterol, as described in 
WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. Another particularly preferred adjuvant formulation employing QS21, 3D- 
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MPL' adjuvant and tocopherol in an oil-in-water emulsion is described in WO 
M5/17210. 

/Vnother enhanced adjuvant system involves the combination of a CpG- 
containing oligonucleotide and a saponin derivative particularly the combination of 
5 CpG and QS21 is disclosed in WO 00/09159. Preferably the formulation additionally 
comprises an oil in water emulsion and tocopherol. 

Additional illustrative adjuvants for use in the pharmaceutical 
compositions of the invention include Montanide ISA 720 (Seppic, France), SAF 
(Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SB AS series 
0 of adju\anis {ca: . SBAS-2 or SBAS-4, available from SmithKHne Beecham, Rixensart, 
Belgium). Detox (Hnhanzyn^'; Corixa, Hamihon, MT), RC-529 (Corixa, Hamilton, MT) 
and other aniinoalkyl glucosaminide 4-phosphates (AGPs), such as those described in 
pending l^S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the disclosures 
of which are incorporated herein by reference in their entireties, and polyoxyethylene 
5 ether adjuvants such as those described in WO 99/52549A1. 

Other preferred adjuvants include adjuvant molecules of the general 

formula 

(1): HO(CH2CH20)n-A-R, 
wherein, n is 1 -50, A is a bond or -C(0)-, R is C1.50 alkyl or Phenyl Cj^so alkyl. 

0 One embodiment of the present invention consists of a vaccine 

formulation comprising a polyoxyethylene ether of general formula (1)» wherein ;? is 
between 1 and 50, preferably 4-24, most preferably 9; the R component is Ci^o, 
preferably CrCio alkyl and most preferably C12 alkyl, and A is a bond. The 
concentration of the polyoxyethylene ethers should be in the range 0.1-20%, preferably 

5 from 0-1-10%, and most preferably in the range 0.1-1%. Preferred polyoxyethylene 
ethers are selected from the following group: polyoxyethylene-9-lauryl ether, 
polyoxyethylene-9-steoryI ether, polyoxyethy]ene-8-steor>i ether, polyoxyethylene-4- 
lauryi ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-laur>d ether. 
Polyoxyethylene ethers such as polyoxyethylene laur>i ether are described in the Merck 

0 index (12"' edition: entr>' 7717). These adjuvant molecules are described in WO 
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99/52549. The polyoxyethylene ether according to the general formula (I) above may, if 
desired, be combined with another adjuvant. For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 
GB 9820956.2. 

5 According to another embodiment of this invention, an immunogenic 

composition described herein is deHvered to a host via antigen presenting cells (APCs), 
such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may, but need not, be genetically modified 
to increase the capacity for presenting the antigen, to improve activation and/or 
10 maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with the receiver {i.e., matched HLA haplotype). APCs 
may generally be isolated fi-om any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic 
or xenogeneic cells. 

15 Certain preferred embodiments of the present invention use dendritic 

cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 392:245-251, 1998) and have been sho^vn to 
be effective as a physiological adjuvant for eliciting prophylactic or therapeutic 
antitimior immunity {see Timmerman and Levy, Rev. Med. 50:507-529, 1999). In 

20 general, dendritic cells may be identified based on their typical shape (stellate m situ^ 
with marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, 
process and present antigens with high efficiency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 
surface receptors or ligands that aie not commonly found on dendritic cells in vivo or ex 

25 vivo, and such modified dendritic cells are contemplated by the present invention. As 
an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine {see Zitvogel et al., Nature Med ^:594-600, 
1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, 
30 bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph 
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nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For 
example, dendritic cells may be differentiated ex vivo by adding a combination of 
cytokines such as GM-CSF, lL-4, IL-13 and/or FNFa to cultures of monocytes 
harvested from peripheral blood. AUematively. CD34 positive cells harvested from 

5 peripheral blood, umbilical cord blood or bone marrow may be differentiated into 
dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFa, 
CD40 ligand, LPS, fltS ligand and/or other compound(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 

0 cells, which allows a simple way to discriminate between two well characterized 
phenotypes. However, this nomenclature should not be construed to exclude all 
possible intemiediate stages of differentiation. Immature dendritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 
correlates with the high expression of Fey receptor and mannose receptor. The mature 

5 phenot>'pe is typically characterized by a lower expression of these markers, but a high 
expression of cell surface molecules responsible for T cell activation such as class I and 
class II MHC, adhesion molecules (e.g., CD54 and GDI 1) and costimulatory molecules 
(e.g., CD40, CD80, CD86 and 4- IBB). 

APCs may generally be transfected with a polynucleotide of the 

0 invention (or portion or other variant thereof) such that the encoded polypeptide, or an 
immunogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a pharmaceutical composition comprising such transfected cells 
may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
deliver>^ vehicle that targets a dendritic or other antigen presenting cell may be 

5 administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex 
vivo transfection of dendritic cells, for example, may generally be perfomied using any 
methods known in the art, such as those described in WO 97/24447, or the gene gun 
approach described by Mahvi et al.. Immunology^ and cell Biology 7J:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 

0 progenitor cells with the tumor polypeptide, DNA (naked or within a plasmid vector) or 
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RNA; or with antigen-expressing recombinant bacterium or viruses (e.g., vaccinia, 
fowipox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be 
covalently conjugated to an immunological partner that provides T cell help (e.g., a 
carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated 
5 immunological partner, separately or in the presence of the polypeptide. 

Wliile any suitable carrier knovm to those of ordinary skill in the art may 
be employed in the pharmaceutical compositions of this invention, the type of carrier 
will t\picaliy \'ar\' depending on the mode of administration. Compositions of the 
present in\ ention may be formulated for any appropriate manner of administration, 

10 including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 
intraperitoneal , subcutaneous and intramuscular administration. 

Carriers for use within such pharmaceutical compositions are 
biocompatible, and may also be biodegradable. In certain embodiments, the 
fonnulation preferably provides a relatively constant level of active component release. 

15 In other embodiments, however, a more rapid rate of release immediately upon 
administration may be desired. The formulation of such compositions is well within the 
level of ordinary skill in the art using known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, 
starch, cellulose, dextran and the like. Other illustrative delayed-release carriers 

20 include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., 
a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 
comprising an amphiphilic compound, such as a phospholipid (see e.g., U.S. Patent No. 
5,151,254 and PCT applications WO 94/20078, WO/94/23701 and WO 96/06638). The 
amoimt of active compound contained within a sustained release formulation depends 

25 upon the site of implantation, the rate and expected duration of release and the nature of 
the condition to be treated or prevented. 

In another illustrative embodiment, biodegradable microspheres (e.g., 
polylactate polyglycolate) are employed as carriers for the compositions of this 
invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 

30 Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
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5.814344, 5,407.609 and 5,942,252. Modified hepatitis B core protein carrier systems, 
such as described in WO/99 40934, and relerences cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 
comprising particulate-protein complexes, such as those described in U.S. Patent No. 

5 5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 
responses in a host. 

The pharmaceutical compositions of the invention will often further 
comprise one or more buffers (e.g., neutral buffered saline or phosphate buffered 
saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 

0 polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating 
agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that 
render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a 
recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

5 The pharmaceutical compositions described herein may be presented in 

unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
are typically sealed in such a way to preserve the sterility and stability of the 
tonnulation until use. In general, fonnulations may be stored as suspensions, solutions 
or emulsions in oily or aqueous vehicles. AUematively, a pharmaceutical composition 

0 may be stored in a freeze-dried condition requiring only the addition of a sterile liquid 
carrier immediately prior to use. 

The development of suitable dosing and treatment regimens for using the 
pailicular compositions described herein in a variety of treatment regimens, including 
e.g.. oral, parenteral, intravenous, intranasal, and intramuscular administration and 

5 formulation, is w^ell known in the art, some of which are briefly discussed below for 
general purposes of illustration. 

In certain applications, the pharmaceutical compositions disclosed herein 
may be delivered via oral administration to an animal. As such, these compositions 
may be formulated with an inert diluent or with an assimilable edible carrier, or they 
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may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
tablets, or they may be incorporated directly with the food of the diet. 

The active compounds may even be incorporated with excipients and 
used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, 
5 suspensions, syrups, wafers, and the like (see, for example, Mathiowitz ei oL, Nature 
1997 Mar 27;386(6623):4]0-4; Hwang et ah, Crit Rev Ther Drug Carrier Syst 
1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
5,792,451). Tablets, troches, pills, capsules and the like may also contain any of a 
variety of additional components, for example, a binder, such as gum tragacanth, acacia, 

10 cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating agent, 
such as com starch, potato starch, alginic acid and the like; a lubricant, such as 
magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 

1 5 materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit. For instance, 
tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form should be pharmaceutically pure and 
substantially non-toxic in the amounts employed. In addition, the active compoimds 

20 may be incorporated into sustained-release preparation and formulations. 

Typically, these formulations will contain at least about 0.1% of the 
active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Naturally, the amount of 

25 active compound(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any given unit dose of the compound. 
Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 

30 variety of dosages and treatment regimens may be desirable. 
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For oral administration, the compositions of the present invention may 
alternatively be incorporated with one or more excipients in the form of a mouthwash, 
dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Alternatively, the active ingredient may be incorporated into an oral solution such as 
5 one containing sodium borate, glycerin and potassium bicarbonate, or dispersed in a 
dentifrice, or added in a therapeutically-effective amount to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 
Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 

10 In certain circumstances it will be desirable to deliver the pharmaceutical 

compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
inlraperitoneally. Such approaches are well known to the skilled artisan, some of which 
are further described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 
and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 

15 as free base or phannacologically acceptable salts may be prepared in water suitably 
mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 
Under ordinary conditions of storage and use, these preparations generally will contain a 
preservative to prevent the growth of microorganisms. 

^-0 Illustrative pharaiaceutical forms suitable for injectable use include 

sterile aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 
5,466,468). In all cases the form must be sterile and must be fluid to the extent tliat 
easy syringability exists. It must be stable under the conditions of manufacture and 

25 storage and must be preser\^ed against the contaminating action of microorganisms, 
such as bacteria and fungi. The carrier can be a solvent or dispersion medimn 
containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 
oils. Proper fluidit\^ may be maintained, for example, by the use of a coating, such as 

^0 lecithin, by the maintenance of the required particle size in the case of dispersion and/or 
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by the use of surfactants. The prevention of the action of microorganisms can be 
facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars or sodium chloride. 
5 Prolonged absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum monostearate 
and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, 
the solution should be suitably buffered if necessary and the liquid diluent first rendered 

10 isotonic with sufficient saline or glucose. These particular aqueous solutions are 
especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, a sterile aqueous medium that can be employed will 
be known to those of skill in the art in light of the present disclosure. For example, one 
dosage may be dissolved in 1 ml of isotonic NaCI solution and either added to 1000 ml 

1 5 of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 1 5th Edition, pages 1035-1038 and 1570- 
1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human administration, preparations will of 
course preferably meet sterility, pyrogenicity, and the general safety and purity 

20 standards as required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed 
herein may be formulated in a neutral or salt form. Illustrative 
pharmaceutically-acceptable salts include the acid addition salts (formed with the free 
amino groups of the protein) and which are fomied with inorganic acids such as, for 

25 example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonivmi, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
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administered in a manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion media, 
vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 

5 delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 
ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. The phrase 

0 "pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be 
delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. 
Methods for delivering genes, nucleic acids, and peptide compositions directly to the 

5 lungs via nasal aerosol sprays has been described, e.g., in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal microparticle 
resins (Takenaga et al, J Controlled Release 1998 Mar 2;52(l-2):81-7) and 
lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871) are also well-known in 
the pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form of 

0 a polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
present invention into suitable host cells/organisms. In particular, the compositions of 
the present invention may be formulated for delivery either encapsulated in a lipid 

5 particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the Hke. Alternatively, 
compositions of the present invention caji be bound, either covalently or non-covalently, 
to the surface of such cairier vehicles. 

The formation and use of liposome and liposome-like preparations as 
potential drug carriers is generally known to those of skill in the art (see for example, 
0 Lasic, Trends Bioteclmol 1998 Jul;]6(7):307-2] ; Takakura, Nippon Rinsho 1998 



wo 01/51633 



PCT/ISO 1/0 1574 



108 

Mar;56(3):691-5; Chandran el a/., Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, 
Crit Rev Ther Drug Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 
Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 
5,795,587, each specifically incorporated herein by reference in its entirety). 
5 Liposomes have been used successfully with a number of cell types that 

are normally difficult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et ai, J Biol Chem. 1990 Sep 
25;265(27): 16337-42; Muller et al, DNA Cell Biol. 1990 Apr;9(3):221-9). In addition, 
liposomes are free of the DNA length constraints that are typical of viral-based delivery 

10 systems. Liposomes have been used effectively to introduce genes, various drugs, 
radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
liposomes does not appear to be associated w^ith autoimmune responses or \inacceptable 
toxicity after systemic delivery. 

15 In certain embodiments, liposomes are formed from phospholipids that 

are dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilamellar vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for 
pharmaceutically-acceptable nanocapsule formulations of the compositions of the 

20 present invention. Nanocapsules can generally entrap compounds in a stable and 
reproducible way (see, for example, Quintanar-Guerrero et aL, Drug Dev Ind Pharm. 
1998 Dec;24(12);l] 13-28). To avoid side effects due to intracellular polymeric 
overloading, such ultrafine particles (sized around 0.1 \xm) may be designed using 
polymers able to be degraded in vivo. Such particles can be made as described, for 

25 example, by Couvreur et al, Crit Rev Ther Drug Carrier Syst. 1988;5(l):l-20; zur 
Muhlen et al, Eur J Pharm Biopharm. 1998 Mar;45(2): 149-55; Zanibaux et al J 
Controlled Release. 1998 Jan 2;50(l-3):31-40; and U. S. Patent 5,145,684. 
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Cancer Therapeutic Methods 

]n further aspects of the present invention, the pharmaceutical 
compositions described herein may be used for the treatment of cancer, particularly for 
the immunotherapy of prostate cancer. Within such methods, the pharmaceutical 

5 compositions described herein are administered to a patient, typically a warm-blooded 
animal, preferably a human. A patient may or may not be afflicted with cancer. 
Accordingly, the above pharmaceutical compositions may be used to prevent the 
development of a cancer or to treat a patient afflicted with a cancer. Pharmaceutical 
compositions and vaccines may be administered either prior to or following surgical 

0 removal of primary tumors and/or treatment such as administration of radiotherapy or 
conventional chemotherapeutic drugs. As discussed above, administration of the 
phannaceutical compositions may be by any suitable method, including administration 
by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, 
anal, vaginal, topical and oral routes. 

5 Within certain embodiments, immunotherapy may be active 

immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous 
host immune system to react against tumors with the administration of immune 
response-modifying agents (such as polypeptides and polynucleotides as provided 
herein). 

0 Within other embodiments, immunotherapy may be passive 

immunotherapy, in which treatment involves the delivery of agents with established 
tumor-immune reactivity (such as effector cells or antibodies) that can directly or 
indirectly mediate antitumor effects and does not necessarily depend on an intact host 
immune system. Examples of effector cells include T cells as discussed above, T 

5 lymphocytes (such as CT)t cytotoxic T lymphocytes and CD4'~ T-helper tumor- 
infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 
activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and 
macrophages) expressing a polypeptide provided herein. 1^ cell receptors and antibody 
receptors specific for the polypeptides recited herein may be cloned, expressed and 

0 transferred into other vectors or effector cells for adoptive immunotherapy. The 
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polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 
immunotherapy. 

Effector cells may generally be obtained in sufficient quantities for 
5 adoptive immunotherapy by growth w vitro, as described herein. Culture conditions for 
expanding single antigen-specific effector cells to several billion in number with 
retention of antigen recognition in vivo are well known in the art. Such in vitro culture 
conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as IL-2) and non-dividing feeder cells. As noted above, 

10 immunoreactive polypeptides as provided herein may be used to rapidly expand 
antigen-specific T cell cultures in order to generate a sufficient number of cells for 
immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 

15 in the art. For example, antigen-presenting cells can be transfected with a 
polynucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long term m vivo. Studies 
have shown that cultured effector cells can be induced to grow in vivo and to survive 

20 long term in substantial numbers by repeated stimulation with antigen supplemented 
with IL-2 (see, for example, Cheever et ah. Immunological Reviews 157:111, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient. Transfected cells may be reintroduced 

25 into the patient using any means known in the art, preferably in sterile form by 
intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes and frequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary fi*om individual to individual, and may be 
readily established using standard techniques. In general, the pharmaceutical 

30 compositions and vaccines may be administered by injection {e.g., intracutaneous, 
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intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
Preferably, 6 doses are administered, at inter\'als of 1 month, and booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 
5 individual patients. A suitable dose is an amount of a compound that, when 
administered as described above, is capable of promoting an anti-tumor immune 
response, and is at least 10-50% above the basal (i.e., untreated) level. Such response 
can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine- 
dependent generation of cytolytic effector cells capable of killing the patient's tumor 

10 cells in vitro. Such vaccines should also be capable of causing an immune response that 
leads to an improved clinical outcome (e,g., more frequent remissions, complete or 
partial or longer disease-free survival) in vaccinated patients as compared to non- 
vaccinated patients. In general, for pharmaceutical compositions and vaccines 
comprising one or more polypeptides, the amount of each polypeptide present in a dose 

15 ranges from about 25 pg to 5 mg per kg of host. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0.1 mL to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the 
active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic 
benefit. Such a response can be monitored by establishing an improved clinical 

20 outcome (e.g., more frequent remissions, complete or partial, or longer disease-free 
survival) in treated patients as compared to non-treated patients. Increases in 
preexisting immune responses to a tumor protein generally correlate with an improved 
clinical outcome. Such immune responses may generally be evaluated using standard 
proliferation, cytotoxicity or cytokine assays, which may be performed using samples 

25 obtained from a patient before and after treatment. 

C ancer Detection and Diagnostic Comp ositi ons, Methods and Kits 

In general, a cancer may be detected in a patient based on the presence of 
one or more prostate tumor proteins and/or polynucleotides encoding such proteins in a 
biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) 
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obtained from the patient. In other words, such proteins may be used as markers to 
indicate the presence or absence of a cancer such as prostate cancer. In addition, such 
proteins may be useful for the detection of other cancers. The binding agents provided 
herein generally permit detection of the level of antigen that binds to the agent in the 
5 biological sample. Polynucleotide primers and probes may be used to detect the level of 
mRNA encoding a tumor protein, which is also indicative of the presence or absence of 
a cancer. In general, a prostate tumor sequence should be present at a level that is at 
least thi*ee fold higher in tumor tissue than in normal tissue 

There are a variety of assay formats known to those of ordinary skill in 

10 the art for using a binding agent to detect polypeptide markers in a sample. See^ e.g., 
Harlow and Lane, Antibodies: A Laboratoiy Manual, Cold Spring Harbor Laboratory, 
1988. In general, the presence or absence of a cancer in a patient may be determined by 
(a) contacting a biological sample obtained from a patient with a binding agent; (b) 
detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 

15 comparing the level of polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding agent 
immobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The bound polypeptide may then be detected using a detection 
reagent that contains a reporter group and specifically binds to the binding 

20 agent/polypeptide complex. Such detection reagents may comprise, for example, a 
binding agent that specifically binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, 
protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 

25 binding agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide to the 
binding agent is indicative of the reactivity of the sample with the immobilized binding 
agent. Suitable polypeptides for use within such assays include full length prostate 
tumor proteins and polypeptide portions thereof to which the binding agent binds, as 

30 described above. 
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1 he solid support may be any material known to those of ordinar\- skill 
111 the art to which the tumor protein may be attached. For example, the solid support 
may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Altemati\ely, the support may be a bead or disc, such as glass, fiberglass, latex or a 

5 plastic material such as polyst>Tene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 
using a variety of tecliniques known to those of skill in the art, which aie amply 
described in the patent and scientific literature. In the context of the present invention, 

0 the term "immobilization" refers to both noncovalent association, such as adsorption, 
and co\alcnl attachment (which may be a direct linkage between the agent and 
functional grt>ups on the support or may be a linkage by way of a cross-linking agent). 
Immohili/^ition by adsorption to a well in a microtiter plate or to a membrane is 
preferred, in such cases, adsorption may be achieved by contacting the binding agent, in 

5 a suitable buffer, with the solid support for a suitable amount of time. The contact time 
varies \vith temperature, but is typically between about 1 hour and about 1 day. In 
genera], contacting a well of a plastic microtiter plate (such as polystyrene or 
polyvinylchloride) with an amount of binding agent ranging from about lOng to about 
10|^g, and preferably about 100 ng to about 1 pg, is sufficient to immobilize an 

0 ;idequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifunctional reagent that will react with 
both the support and a functional group, such as a hydroxyl or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
5 having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner (see. e.g.. Pierce Immunotechnology Catalog and Handbook, 1991, at 
A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. 
0 I his assa\' may be performed by first contacting an antibody tliat has been immobilized 
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on a solid support, commonly the well of a microtiter plate, with the sample, such that 

polypeptides within the sample are allowed to bind to the immobilized antibody. 

Unbound sample is then removed from the inmiobilized polypeptide-antibody 

complexes and a detection reagent (preferably a second antibody capable of binding to a 
5 different site on the polypeptide) containing a reporter group is added. The amoimt of 

detection reagent that remains bound to the solid support is then determined using a 

method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 

described above, the remaining protein binding sites on the support are typically 
1 0 blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as 

bovine serum albumin or Tween 20™ (Sigma Chemical Co,, St. Louis, MO). The 

immobilized antibody is then incubated with the sample, and polypeptide is allowed to 

bind to the antibody. The sample may be diluted with a suitable diluent, such as 

phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
15 time (i.e., incubation time) is a period of time that is sufficient to detect the presence of 

polypeptide within a sample obtained from an individual with prostate cancer. 

Preferably, the contact time is sufficient to achieve a level of binding that is at least 

about 95% of that achieved at equilibrium between bound and unbound polypeptide. 

Those of ordinary skill in the art will recognize that the time necessary to achieve 
20 equilibrium may be readily determined by assaying the level of binding that occurs over 

a period of time. At room temperature, an incubation time of about 30 minutes is 

generally sufficient. 

Unbound sample may then be removed by washing the solid support 

with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
25 antibody, which contains a reporter group, may then be added to the solid support. 

Preferred reporter groups include those groups recited above. 

The detection reagent is then incubated with the immobilized antibody- 

polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 

An appropriate amount of time may generally be determined by assaying the level of 
30 binding that occurs over a period of time. Unbound detection reagent is then removed 
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and hound detection reagent is detected using the reporter group. The method employed 
lor detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
5 and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). EnzyTOe 
reporter groups may generally be detected by the addition of substrate (generally for a 
specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

10 To determine the presence or absence of a cancer, such as prostate 

cancer, the signal detected from the reporter group that remains bound to the solid 
support is generally compared to a signal that corresponds to a predetennined cut-off 
value. In one preferred embodiment, the cut-off value for the detection of a cancer is 
the average mean signal obtained when the immobilized antibody is incubated with 

1 5 samples from patients without the cancer. In general, a sample generating a signal that 
is three standard deviations above the predetemiined cut-off value is considered positive 
for the cancer. In an alternate preferred embodiment, the cut-off value is determined 
using a Receiver Operator Curve, according to the method of Sackett et ah. Clinical 
Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, 

20 p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot 
of pairs of true positive rates (i.e., sensitivity^) and false positive rates (100%-specificity) 
that coirespond to each possible cut-off value for the diagnostic test result. The cut-off 
^alue on the plot that is the closest to the upper left-hand comer (i.e,, the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 

25 signal that is higher than the cut-off value detemiined by this method may be considered 
positive. Alternatively, the cut-off value may be shifted to the left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative rate. In 
general, a sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for a cancer. 
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In a related embodiment, the assay is performed in a flow-through or 
strip test format, wherein the binding agent is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, 
5 labeled binding agent then binds to the binding agent-polypeptide complex as a solution 
containing the second binding agent flows through the membrane. The detection of 
bound second binding agent may then be performed as described above. In the strip test 
fomiat, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 

10 region containing second binding agent and to the area of immobilized binding agent. 
Concentration of second binding agent at the area of immobilized antibody indicates the 
presence of a cancer. Typically, the concentration of second binding agent at that site 
generates a pattern, such as a line, that can be read visually. The absence of such a 
pattern indicates a negative result. In general, the amount of binding agent immobilized 

15 on the membrane is selected to generate a visually discernible pattern when the 
biological sample contains a level of polypeptide that would be sufficient to generate a 
positive signal in the two-antibody sandwich assay, in the format discussed above. 
Preferred binding agents for use in such assays are antibodies and antigen-binding 
fi-agments thereof. Preferably, the amount of antibody immobilized on the membrane 

20 ranges from about 25 ng to about l|ag, and more preferably from about 50ng to about 
500 ng. Such tests can typically be performed with a very small amount of biological 
sample. 

Of course, numerous other assay protocols exist that are suitable for use 
wath the tumor proteins or binding agents of the present invention. The above 
25 descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may be readily modified to use 
tumor polypeptides to detect antibodies that bind to such polypeptides in a biological 
sample. The detection of such tumor protein specific antibodies may correlate with the 
presence of a cancer. 
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A cancer may also, or alternatively, be detected based on the presence of 
1 cells that specifically react with a tumor protein in a biological sample. Within 
certain methods, a biological sample comprising CD4^ and/or CDS"*' T cells isolated 
from a patient is incubated with a tumor polypeptide, a polynucleotide encoding such a 
5 polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 
Suitable biological san:iples include, but are not limited to, isolated T cells. For 
example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T 

10 cells may be incubated in vitro for 2-9 days (typically 4 days) at 3TC \v\\h polypeptide 
{e.g., 5-25 )^g/ml). It may be desirable to incubate another aliquot of a Tcell sample in 
the absence of tumor polypeptide to serve as a control. For CD4^ T cells, activation is 
preferably detected by evaluating proliferation of the T cells. For CD8^ T cells, 
activation is preferably detected by evaluating cytolytic activity. A level of proliferation 

15 that is at least two fold greater and/or a level of cytolytic activity that is at least 20% 
greater than in disease-free patients indicates the presence of a cancer in the patient. 

As noted above, a cancer may also, or alternatively, be detected based on 
the level of mRNA encoding a tumor protein in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction 

20 (PGR) based assay to amplify a portion of a tumor cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for (i.e., 
hybridizes to) a polynucleotide encoding the tumor protein. The amplified cDNA is 
then separated and detected using techniques well knowTi in the art, such as gel 
electrophoresis. Similarly, oligonucleotide probes that specifically hybridize to a 

25 polynucleotide encoding a tumor protein may be used in a hybridization assay to detect 
the presence of polynucleotide encoding the tumor protein in a biological sample. 

To permit hybridization under assay conditions, oligonucleotide primers 
and probes should comprise an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to a 

30 portion of a polynucleotide encoding a tumor protein of the invention that is at least 10 
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nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 
oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a 
polypeptide described herein under moderately stringent conditions, as defined above. 
Oligonucleotide primers and/or probes which may be usefully employed in the 
5 diagnostic methods described herein preferably are at least 10-40 nucleotides in length. 
In a preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 
having a sequence as disclosed herein. Techniques for both PGR based assays and 
hybridization assays are well known in the art (see, for example, Mullis et al.. Cold 
10 Spring Harbor Symp. Quant, Biol, 57:263, 1987; Eriich ed., PCR Technology, Stockton 
Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied in 
conjunction with reverse transcription. Typically, RNA is extracted from a biological 
sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 

1 5 PCR amplification using at least one specific primer generates a cDNA molecule, which 
may be separated and visualized using, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken fi-om a test patient and from an 
individual who is not afflicted with a cancer. The amplification reaction may be 
performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold 

20 or greater increase in expression in several dilutions of the test patient sample as 
compared to the same dilutions of the non-cancerous sample is typically considered 
positive. 

In another embodiment, the compositions described herein may be used 
as markers for the progression of cancer. In this embodiment, assays as described above 
25 for the diagnosis of a cancer may be performed over time, and the change in the level of 
reactive polypeptide(s) or polynucleotide(s) evaluated. For example, the assays may be 
performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 
performed as needed. In general, a cancer is progressing in those patients in whom the 
level of polypeptide or polynucleotide detected increases over time. In contrast, the 
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cancer is not progressing when the level of reactive polypeptide or polynucleotide either 
remains constant or decreases with time. 

Certain /;? vivo diagnostic assays may be performed directly on a tumor. 
One such assay involves contacting tumor cells with a binding agent. The bound 
5 binding agent may then be detected directly or indirectly via a reporter group. Such 
binding agents may also be used in histological applications. Alternatively, 
polynucleotide probes may be used w'ithin such applications. 

As noted above, to improve sensitivity', multiple tumor protein markers 
may be assayed within a given sample. It will be apparent that binding agents specific 
1 0 for different proteins provided herein may be combined within a single assay. Further, 
multiple primers or probes may be used concurrently. The selection of tumor protein 
markers may be based on routine experiments to determine combinations that results in 
optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided 
herein may be combined with assays for other known tumor antigens. 
13 The present invention further provides kits for use within any of the 

above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, reagents, 
containers and/or equipment. For example, one container within a kit may contain a 
monoclonal antibody or fragment thereof that specifically binds to a tumor protein. 
20 Such antibodies or fragments may be provided attached to a support material, as 
described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or alternatively, contain 
a detection reagent as described above that contains a reporter group suitable for direct 
or mdirect detection of antibody binding. 
25 Alternatively, a kit may be designed to detect the level of mRNA 

encoding a tumor protein in a biological sample. Such kits generally comprise at least 
one oligonucleotide probe or primer, as described above, that hybridizes to a 
polynucleotide encoding a tumor protein. Such an oligonucleotide may be used, for 
example, within a PCR or hybridization assay. Additional components that may be 
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present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a tumor protein. 

The following Examples are offered by way of illustration and not by 
way of limitation. 

5 

EXAMPLES 



EXAMPLE 1 

Isolation and Characterization of Prostate-specific Polypeptides 



10 



This Example describes the isolation of certain prostate-specific 
polypeptides from a prostate tiunor cDNA library. 

A human prostate tumor cDNA expression library was constructed from 
prostate tumor poly A*^ RNA using a Superscript Plasmid System for cDNA Synthesis 

15 and Plasmid Cloning kit (BRL Life Teclinologies, Gaithersburg, MD 20897) following 
the manufacturer's protocol. Specifically, prostate tumor tissues were homogenized 
with polytron (Kinematica, Switzerland) and total RNA was extracted using Trizol 
reagent (BRL Life Technologies) as directed by the manufacturer. The poly A^ RNA 
was then purified using a Qiagen oligotex spin column mRNA purification kit (Qiagen, 

20 Santa Clarita, CA 91355) according to the manufacturer's protocol. First-strand cDNA 
was synthesized using the NotI/01igo-dT18 primer. Double-stranded cDNA was 
synthesized, ligated with EcoRI/BAXI adaptors (Invitrogen, San Diego, CA) and 
digested with Notl. Following size fractionation with Chroma Spin-1000 columns 
(Clontech, Palo Alto, CA), the cDNA was ligated into the EcoRI/NotI site of 

25 pCDNAS.l (Invitrogen) and transformed into ElectroMax E, coli DHIOB cells (BRL 
Life Technologies) by electroporation. 

Using the same procedure, a normal human pancreas cDNA expression 
library was prepared from a pool of six tissue specimens (Clontech). The cDNA 
libraries were characterized by determining the number of independent colonies, the 

30 percentage of clones that carried insert, the average insert size and by sequence analysis. 
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The prostate tumor librar}' contained 1.64 x 10^ independent colonies, with 10% of 
clones having an insert and the average insert size being 1 745 base pairs. The normal 
pancreas cDNA librar\^ contained 3.3 x 10^ independent colonies, with 69% of clones 
having inserts and the average insert size being 1120 base pairs. For both libraries, 
5 sequence analysis showed that the majorit}^ of clones had a full length cDNA sequence 
and were synthesized from mRNA, with minimal rRNA and mitochondrial DNA 
contamination. 

cDNA library subtraction was performed using the above prostate tumor 
and normal pancreas cDNA libraries, as described by Hara et al. (Blood, 189-1 99, 

10 1994) with some modifications. Specifically, a prostate tumor-specific subtracted 
cDNA librar>^ was generated as follows. Normal pancreas cDNA library (70 \xg) was 
digested with EcoRl, NotI, and SfuT, followed by a filling-in reaction with DNA 
polymerase Klenow fragment. After phenol -chloroform extraction and ethanol 
precipitation, the DNA was dissolved in 100 /.il of H2O, heat-denatured and mixed with 

15 100 |al (100 i^g) of Photoprobe biotin (Vector Laboratories, Burlingame, CA). As 
reconmnended by the manufacturer, the resulting mixture was irradiated with a 270 W 
sunlamp on ice for 20 minutes. Additional Photoprobe biotin (50 jal) was added and 
the biotinylation reaction was repeated. After extraction with butanol five times, the 
DNA was ethanol-precipitated and dissolved in 23 \i\ H2O to form the driver DNA. 

20 To form the tracer DNA, 10 \xg prostate tumor cDNA library was 

digested with BamHI and Xhol, phenol chloroform extracted and passed through 
Chroma spin-400 columns (Clontech). Following ethanol precipitation, the tracer DNA 
was dissolved in 5 \\\ H2O. Tracer DNA was mixed with 1 5 pi driver DNA and 20 \x\ of 
2 X hybridization buffer (1.5 M NaCl/10 mM EDTA/50 mM HEPES pH 7.5/0.2% 

25 sodium dodecyl sulfate), overlaid with mineral oil, and heat-denatured completely. The 
sample was immediately transferred into a 68 water bath and incubated for 20 hours 
(long hybridization [LH]). The reaction mixture was then subjected to a streptavidin 
treatment followed by phenol/chloroform extraction. This process was repeated three 
more times. Subtracted DNA was precipitated, dissolved in 12 |.il H2O, mixed with 8 pi 

30 driver DNA and 20 ^1 of 2 x hybridization buffer, and subjected to a hybridization at 68 
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for 2 hours (short hybridization [SH]). After removal of biotinylated double- 
stranded DN A, subtracted cDNA was ligated into BaraHI/XhoI site of chloramphenicol 
resistant pBCSK"^ (Stratagene, La Jolla, CA 92037) and transformed into ElectroMax E, 
coU DJIlOB cells by electroporation to generate a prostate tumor specific subtracted 
5 cDNA library (referred to as "prostate subtraction T'). 

To analyze the subtracted cDNA library, plasmid DNA was prepared 
from 100 independent clones, randomly picked from the subtracted prostate tumor 
specific library and grouped based on insert size. Representative cDNA clones were 
further characterized by DNA sequencing with a Perkin Elmer/ Applied Biosy stems 
10 Division Automated Sequencer Model 373 A (Foster City, CA). Six cDNA clones, 
hereinafter referred to as Fl-13, Fl-12, Fl-16, Hl-1, Hl-9 and Hl-4, were shown to be 
abundant in the subtracted prostate-specific cDNA library. The determined 3' and 5' 
cDNA sequences for Fl-12 are provided in SEQ ID NO: 2 and 3, respectively, with 
determined 3' cDNA sequences for F1-I3, FM6, HI-1, Hl-9 and Hl-4 being provided 
1 5 in SEQ ID NO: 1 and 4-7, respectively. 

The cDNA sequences for the isolated clones were compared to known 
sequences in the gene bank using the EMBL and GenBank databases (release 96). Four 
of the prostate tumor cDNA clones, Fl-13, Fl-16, Hl-1, and Hl-4, were deteraiined to 
encode the following previously identified proteins: prostate specific antigen (PSA), 
20 human glandular kallikrein, human tumor expression enhanced gene, and mitochondria 
cytochrome C oxidase subunit 11. Hl-9 was found to be identical to a previously 
identified human autonomously replicating sequence. No significant homologies to the 
cDNA sequence for Fl-12 were found. 

Subsequent studies led to the isolation of a full-length cDNA sequence 
25 for Fl-12 (also referred to as P504S). This sequence is provided in SEQ ID NO: 107, 
with the corresponding predicted amino acid sequence being provided in SEQ ID NO: 
108. cDNA splice variants of P504S are provided in SEQ ID NO: 600-605. 

To clone less abundant prostate tumor specific genes, cDNA library 
subtraction was performed by subtracting the prostate tumor cDNA library described 
30 above with the normal pancreas cDNA library and with the three most abundant genes 
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in the previously subtracted prostate tumor specific cDNA librar>': human glandular 
kalJikrein, prostate specific antigen (PSA), and mitochondria cytochrome C oxidase 
subunit 11. Specifically. 1 /ag each of human glandular kallikrein, PSA and 
mitochondria cytochrome C oxidase subunit II cDNAs in pCDNA3.1 were added lo the 

5 driver DNA and subtraction was performed as described above to provide a second 
subtracted cDNA library hereinafter referred to as the "subtracted prostate tumor 
specific cDNA library with spike". 

Twentv^-two cDNA clones were isolated from the subtracted prostate 
tumor specific cDNA librar>^ with spike. The detennined 3' and 5' cDNA sequences for 

0 the clones referred to as Jl-17, Ll-12, Nl-1862, Jl-13, JI-19, Jl-25, Jl-24, Kl-58, Kl- 
63, Ll-4 and Ll-14 are provided in SEQ IDNOS: 8-9, 10-1 L 12-13, 14-15, 16-17, 18- 
19, 20-21, 22-23, 24-25, 26-27 and 28-29, respectively. The determined 3' cDNA 
sequences for the clones referred to as Jl-12, Jl-16, Jl-21, Kl-48, Kl-55, Ll-2, Ll-6, 
N1-I858, NM860, Nl-1861, Nl-1864 are provided in SEQ ID NOS: 30-40, 

5 respectively. Comparison of these sequences w^ith those in the gene bank as described 
above, revealed no significant homologies to three of the five most abundant DNA 
species, (Jl-17, Ll-12 and Nl-1862; SEQ IDNOS: 8-9, 10-11 and 12-13, respectively). 
Of the remaining two most abundant species, one (Jl-12; SEQ ID NO:30) was found to 
be identical to the previously identified human pulmonary surfactant-associated protein, 

0 and the other (Kl-48; SEQ ID NO:33) was determined to have some homology to K 
norvegicus mRNA for 2-arylpropionyl-CoA epimerase. Of the 17 less abundant cDNA 
clones isolated from the subtracted prostate tumor specific cDNA library with spike, 
four (JM6, Kl-55, Ll-6 and Nl-1864; SEQ ID N0S:31, 34, 36 and 40, respecfively) 
were found to be identical to previously identified sequences, two (Jl-21 and Nl-1860; 

5 SEQ ID NOS: 32 and 38, respectively) were found to show some homology to non- 
human sequences, and two (Ll-2 and Nl-1861; SEQ ID NOS: 35 and 39, respectively) 
were found to show some homology to known human sequences. No significant 
homologies were found to the polypeptides Jl-13, Jl-19, Jl-24, Jl-25, Kl-58, Kl-63, 
Ll-4, Ll-14 (SEQ ID NOS: 14-15, 16-17, 20-21, 18-19, 22-23, 24-25, 26-27, 28-29, 

0 respectively). 
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Subsequent studies led to the isolation of full length cDNA sequences for 
Jl-17, Ll-12 and Nl-1862 (SEQ ID NOS: 109-111, respectively). The corresponding 
predicted amino acid sequences are provided in SEQ ID NOS: 112-1 14. Ll-12 is also 
referred to as P501S. A cDNA splice variant of P501S is provided in SEQ ID NO: 606. 
5 In a further experiment, four additional clones ^vere identified by 

subtracting a prostate tumor cDNA library with normal prostate cDNA prepared from a 
pool of three normal prostate poly A+ RN A (refen*ed to as "prostate subtraction 2"). 
The determined cDNA sequences for these clones, hereinafter referred to as U 1-3 064, 
Ul-3065, Vl-3692 and lA-3905, are provided in SEQ ID NO: 69-72, respectively. 

10 Comparison of the determined sequences with those in the gene bank revealed no 
significant homologies to U 1-3065. 

A second subtraction with spike (referred to as "prostate subtraction 
spike 2") was performed by subtracting a prostate tumor specific cDNA library with 
spike with nonnal pancreas cDNA library and further spiked with PSA, Jl-17, 

15 pulmonary surfactant-associated protein, mitochondrial DNA, cytochrome c oxidase 
subunit II, Nl-1862, autonomously replicating sequence, Ll-12 and tumor expression 
enhanced gene. Four additional clones, hereinafter referred to as VI -3686, Rl-2330, 
lB-3976 and VI -3679, were isolated. The determined cDNA sequences for these 
clones are provided in SEQ ID NO:73-76, respectively. Comparison of these sequences 

20 with those in the gene bank revealed no significant homologies to VI -3 686 and Rl- 
2330. 

Further analysis of the three prostate subtractions described above 
(prostate subtraction 2, subtracted prostate tumor specific cDNA library with spike, and 
prostate subtraction spike 2) resulted in the identification of sixteen additional clones, 

25 referred to as lG-4736, lG-4738, lG-4741, lG-4744, lG-4734, lH-4774, lH-4781, 
lH-4785, lH-4787, lH-4796, 11-4810, 11-4811, lJ-4876, lK-4884 and lK-4896. The 
determined cDNA sequences for these clones are provided in SEQ ID NOS: 77-92, 
respectively. Comparison of these sequences with those in the gene bank as described 
above, revealed no significant homologies to lG-4741, lG-4734, 11-4807, lJ-4876 and 

30 lK-4896 (SEQ ID NOS: 79, 81, 87, 90 and 92, respectively). Further analysis of the 
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isv:)lated clones led to the determination of extended cDNA sequences for lG-4736, IG- 
4738, lG-4741. lG-4744, lH-4774, lH-4781, lH-4785, lH-4787, lH-4796, 11-4807, 
lJ-4876, lK-4884 imd lK-4896, provided in SEQ ID NOS: 179-188 and 191-193, 
respectively, and to the detemiination of additional partial cDNA sequences for 11-4810 
5 and 11-481 1 , provided in SEQ ID NOS: 189 and 190, respectively. 

Additional studies with prostate subtraction spike 2 resulted in the 
isolation ol^ three more clones. Their sequences were determined as described above 
:ind compvircd to the most recent GenBank. All three clones were found to have 
honn)logy to known genes, which aie Cysteine-rich protein, KIAA0242, and KIAA0280 

10 (SEQ ID NO: 317. 319, and 320, respectively). Further analysis of these clones by 
Synteni microarray {Synteni, Palo Alto, CA) demonstrated that all three clones w^ere 
over-expressed in mt)st prostate tumors and prostate BPH, as well as in the majority of 
normal prosiaie tissues tested, but low expression in all other nomial tissues. 

An additional subtraction was performed by subtracting a normal 

15 prostate cDNA library with normal pancreas cDNA (referred to as ''prostate subtraction 
3")* This led to the identification of six additional clones referred to as lG-4761, IG- 
4762, lH-4766, lH-4770, lH-4771 and lH-4772 (SEQ ID NOS: 93-98). Comparison 
of these sequences with those in the gene bank revealed no significant homologies to 
lG-4761 and lH-4771 (SEQ ID NOS: 93 and 97, respectively). Further analysis of the 

20 isolated clones led to the determination of extended cDNA sequences for lG-4761, IG- 
4762, lH-4766 and lH-4772 provided in SEQ ID NOS: 194-196 and 199, respectively, 
and to the detemnnation of additional partial cDNA sequences for lH-4770 and IH- 
4771, provided in SEQ ID NOS: 197 and 198, respectively. 

Subtraction of a prostate tumor cDNA library, prepared from a pool of 

25 polyAH- RNA from three prostate cancer patients, with a normal pancreas cDNA library 
(prostate subtraction 4) led to tlie identification of eight clones, referred to as lD-4297, 
lD-4309, lD.1-4278, lD-4288, lD-4283, lD-4304, lD-4296 and lD-4280 (SEQ ID 
NOS: 99-107). I hese sequences w^ere compared to those in the gene bank as described 
above. No significant homologies were found to lD-4283 and lD-4304 (SEQ ID NOS: 

■^0 103 and 104, respectively). Further analysis of the isolated clones led to the 
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determination of extended cDNA sequences for lD-4309, lD.1-4278, lD-4288, ID- 
4283, lD-4304, lD-4296 and lD-4280, provided in SEQ ID NOS: 200-206, 
respectively. 

cDNA clones isolated in prostate subtraction 1 and prostate subtraction 
5 2, described above, were colony PGR amplified and their mRNA expression levels in 
prostate tumor, normal prostate and in various other normal tissues were determined 
using microarray technology (Synteni, Palo Alto, CA). Briefly, the PGR amplification 
products were dotted onto slides in an array format, with each product occupying a 
unique location in the array. mRNA was extracted from the tissue sample to be tested, 

10 reverse transcribed, and fluorescent-labeled cDNA probes were generated. The 
microarrays were probed with the labeled cDNA probes, the slides scanned and 
fluorescence intensity was measured. This intensity correlates with the hybridization 
intensity. Two clones (referred to as P509S and P510S) were found to be over- 
expressed in prostate tumor and normal prostate and expressed at low levels in all other 

1 5 normal tissues tested (liver, pancreas, skin, bone marrow, brain, breast, adrenal gland, 
bladder, testes, salivary gland, large intestine, kidney, ovary, lung, spinal cord, skeletal 
muscle and colon). The determined cDNA sequences for P509S and P510S are 
provided in SEQ ID NO: 223 and 224, respectively. Comparison of these sequences 
with those in the gene bank as described above, revealed some homology to previously 

20 identified ESTs. 

Additional, studies led to the isolation of the fall-length cDNA sequence 
for P509S. This sequence is provided in SEQ ID NO: 332, with the corresponding 
predicted amino acid sequence being provided in SEQ ID NO: 339. Two variant fall- 
length cDNA sequences for P510S are provided in SEQ ID NO: 535 and 536, with the 

25 corresponding predicted amino acid sequences being provided in SEQ ID NO: 537 and 
538, respectively. Additional spHce variants of P510S are provided in SEQ ID NO: 598 
and 599. 
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EXAMPLE 2 

Determination of Tissue Specificity of Prostate-specific Polypeptides 

Using gene specific primers, mRNA expression levels for the 
5 representative prostate-specific polypeptides El -16, Hl-1, J 1-1 7 (also refeiTcd to as 
P502SX Ll-12 (also refened to as P501S), FM2 (also referred to as P504S) and Nl- 
1862 (also referred to as P503S) were examined in a variety of normal and tumor tissues 
using RT-PCR. 

Briefly, total RNA was extracted from a variety of nomial and tumor 

10 tissues using Trizol reagent as described above. First strand synthesis was carried out 
using 1-2 /Lig of total RNA with Superscript II reverse transcriptase (BRL Life 
f echnologies) at 42 for one hour. The cDNA was then amplified by PGR with genc- 
specific primers. To ensure the semi-quantitative nature of the RT-PCR, p-actin was 
used as an internal control for each of the tissues examined. First, serial dilutions of the 

15 first strand cDNAs were prepared and RT-PCR assays were performed using p-actin 
specific primers. A dilution was then chosen that enabled the linear range amplification 
of the P-actin template and which was sensitive enough to reflect the differences in the 
mitial copy numbers. Using these conditions, the P-actin levels w^ere determined for 
each reverse transcription reaction from each tissue. DNA contamination was 

20 minimized by DNase treatment and by assuring a negative PCR result when using first 
strand cDNA that was prepared without adding reverse transcriptase. 

mRNA Expression levels were examined in four different types of tumor 
tissue (prostate tumor from 2 patients, breast tumor from 3 patients, colon tumor, lung 
tumor), and sixteen different normal tissues, including prostate, colon, kidney, liver, 

25 lung, ovary, pancreas, skeletal muscle, skin, stomach, testes, bone marrow and brain. 
Fl-16 was found to be expressed at high levels in prostate tumor tissue, colon tumor 
and normal prostate, and at low^er levels in normal liver, skin and testes, with expression 
being undetectable in the other tissues examined. Hl-1 was found to be expressed at 
high levels in prostate tumor, lung tumor, breast tumor, normal prostate, normal colon 

^0 and normal brain, at much lower levels in normal lung, pancreas, skeletal muscle, skin. 
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small intestine, bone maiTow, and was not detected in the other tissues tested. Jl-17 
(P502S) and Ll-12 (P501S) appear to be specifically over-expressed in prostate, with 
both genes being expressed at high levels in prostate tumor and normal prostate but at 
low to undetectable levels in all the other tissues examined. Nl-1862 (P503S) was 
5 found to be over-expressed in 60% of prostate tumors and detectable in normal colon 
and kidney. The RT-PCR results thus indicate that Fl-16, Hl-U Jl-17 (P502SX Nl- 
1862 (P503S) and Ll-12 (P501S) are either prostate specific or are expressed at 
significantly elevated levels in prostate. 

Further RT-PCR studies showed that Fl-12 (P504S) is over-expressed in 

10 60% of prostate tumors, detectable in normal kidney but not detectable in all other 
tissues tested. Similarly, Rl-2330 was shown to be over-expressed in 40% of prostate 
tumors, delectable in normal kidney and liver, but not detectable in all other tissues 
tested. U I -3064 was found to be over-expressed in 60% of prostate tumors, and also 
expressed in breast and colon tumors, but was not detectable in normal tissues. 

15 RT-PCR characterization of Rl-2330, Ul-3064 and lD-4279 showed 

that these three antigens are over-expressed in prostate and/or prostate tumors. 

Northern analysis with four prostate tumors, two normal prostate 
samples, two BPH prostates, and normal colon, kidney, liver, lung, pancrease, skeletal 
muscle, brain, stomach, testes, small intestine and bone marrow, showed that Ll-12 

20 (P501S) is over-expressed in prostate tumors and normal prostate, while being 
undetectable in other normal tissues tested. Jl-17 (P502S) was detected in two prostate 
tumors and not in the other tissues tested. Nl-1862 (P503S) was found to be over- 
expressed in three prostate tumors and to be expressed in noimal prostate, colon and 
kidney, but not in other tissues tested. Fl-12 (P504S) was found to be highly expressed 

25 in two prostate tumors and to be undetectable in all other tissues tested. 

The microarray technology described above was used to determine the 
expression levels of representative antigens described herein in prostate tumor, breast 
timior and the following normal tissues: prostate, liver, pancreas, skin, bone marrow, 
brain, breast, adrenal gland, bladder, testes, salivary gland, large intestine, kidney, 

30 ovary, lung, spinal cord, skeletal muscle and colon. Ll-12 (P501S) was foimd to be 
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over-expressed in nomial prostate and prostate tumon with some expression being 
detected in normal skeletal muscle. Both J 1-12 and Fl-12 (P504S) were found to be 
over-expressed in prostate tumor, with expression being lower or undetectable in all 
other tissues tested. Nl-1862 (P503S) was found to be expressed at high levels in 

5 prostate tumor and nonnal prostate, and at low levels in normal large intestine and 
normal colon, with expression being undetectable in all other tissues tested. Rl-2330 
was found to be over-expressed in prostate tumor and normal prostate, and to be 
expressed at lower levels in all other tissues tested. lD-4279 was found to be over- 
expressed in prostate tumor and nonnal prostate, expressed at lower levels in normal 

0 spinal cord, and to be undetectable in all other tissues tested. 

Further microarray analysis to specifically address the extent to which 
P501S (SEQ ID NO: 110) was expressed in breast tumor revealed moderate over- 
expression not only in breast tumor, but also in metastatic breast tumor (2/31), with 
negligible to low expression in normal tissues. This data suggests that P501S may be 

5 over-expressed in various breast tumors as well as in prostate tumors. 

The expression levels of 32 ESTs (expressed sequence tags) described by 
Vasmatzis et al {Proc. Natl Acad. Sci. USA 95:300-304, 1998) in a variety of tumor 
and normal tissues were examined by microarray technology as described above. Two 
of these clones (referred to as PIOOOC and PI 001 C) were found to be over-expressed in 

(> prostate tumor and normal prostate, and expressed at low to undetectable levels in all 
other tissues tested (normal aorta, thymus, resting and activated PBMC, epithelial cells, 
spinal cord, adrenal gland, fetal tissues, skin, salivary gland, large intestine, bone 
marrow, liver, lung, dendritic cells, stomach, lymph nodes, brain, heart, small intestine, 
skeletal muscle, colon and kidney. The determined cDNA sequences for PIOOOC and 

5 PIOOIC are provided in SEQ ID NO: 384 and 472, respectively. The sequence of 
PI 001 C was found to show some homology to the previously isolated Human mRNA 
for JM27 protein. Subsequent comparison of the sequence of SEQ ID NO: 384 with 
sequences in the public databases, led to the identification of a full-length cDNA 
sequence of PIOOOC (SEQ ID NO: 786), which encodes a 492 amino acid sequence. 
0 Analysis of the amino acid sequence using the PSORT II program led to the 
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identification of a putative transmembrane domain from amino acids 84-100. The 
cDNA sequence of the open reading frame of PIOOOC, including the stop codon, is 
provided in SEQ ID NO: 787, with the open reading frame without the stop codon being 
provided in SEQ ID NO: 788. The full-length amino acid sequence of PIOOOC is 
5 provided in SEQ ID NO: 789. SEQ ID NO: 790 and 791 represent amino acids 1-100 
and 100-492 of PIOOOC, respectively. 

The expression of the polypeptide encoded by the full length cDNA 
sequence for Fl-12 (also referred to as P504S; SEQ ID NO: 108) was investigated by 
immunohistochemical analysis. Rabbit-anti-P504S polyclonal antibodies were 

10 generated against the full length P504S protein by standard techniques. Subsequent 
isolation and characterization of the polyclonal antibodies were also performed by 
techniques well known in the art. Immunohistochemical analysis showed that the 
P504S polypeptide was expressed in 100% of prostate carcinoma samples tested (n=5). 

The rabbit-anti-P504S polyclonal antibody did not appear to label benign 

15 prostate cells with the same cytoplasmic granular staining, but rather with light nuclear 
staining. Analysis of normal tissues revealed that the encoded polypeptide was found to 
be expressed in some, but not all normal human tissues. Positive cytoplasmic staining 
with rabbit-anti-P504S polyclonal antibody was found in normal hmnan kidney, liver, 
brain, colon and lung-associated macrophages, whereas heart and bone marrow were 

20 negative. 

This data indicates that the P504S polypeptide is present in prostate 
cancer tissues, and that there are qualitative and quantitative differences in the staining 
between benign prostatic hyperplasia tissues and prostate cancer tissues, suggesting that 
this polypeptide may be detected selectively in prostate tumors and therefore be usefiil 
25 in the diagnosis of prostate cancer. 
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EX.A.MPLE 3 

Isolation Akd Characterization Of Prostate-Specific 
Polypeptides By Pcr-Based Subtraction 



5 A cDNA subtraction librar\\ containing cDNA from normal prostate 

subtracted with ten other normal tissue cDNAs (brain, heart, kidney, liver, lung, ovary, 
placenta, skeletal muscle, spleen and thymus) and then submitted to a first round of 
PCR amplification, was purchased from Clontech. This library^ was subjected to a 
second round of PCR amplification, following the manufacturer's protocol. The 
1 0 resuhing cDNA fragments were subcloned into the vector pT7 Blue T-vector (Novagen, 
Madison, Wl) and transformed into XL-1 Blue MRF' E. coli (Stratagene). DNA was 
isolated from independent clones and sequenced using a Perkin Elmer/Applied 
Biosy stems Division Automated Sequencer Model 3 73 A. 

Fifty-nine positive clones were sequenced. Comparison of the DNA 

15 sequences of these clones with those in the gene bank, as described above, revealed no 
significant homologies to 25 of these clones, hereinafter referred to as P5, P8, P9, P18, 
P20, P30, P34, P36, P38, P39, P42, P49, P50, P53, P55, P60, P64, P65, P73, P75, P76, 
P79 and P84. The determined cDNA sequences for these clones are provided in SEQ 
ID NO: 41-45, 47-52 and 54-65, respectively. P29, P47, P68, P80 and P82 (SEQ ID 

20 NO: 46, 53 and 66-68, respectively) were found to show some degree of homology to 
previously identified DNA sequences. To the best of the inventors' knowledge, none of 
these sequences have been previously shown to be present in prostate. 

Further studies employing the sequence of SEQ ID NO: 67 as a probe in 
standard full-length cloning methods, resulted in the isolation of tliree cDNA sequences 

25 which appear to be splice variants of P80 (also known as P704P). These sequences are 
provided in SEQ ID NO: 620-622. 

Further studies using the PCR-based methodology described above 
resulted in the isolation of more than 180 additional clones, of which 23 clones were 
found to show no significant homologies to known sequences. The determined cDNA 

30 sequences for these clones are provided in SEQ ID NO: 115-123, 127, 131, 137, 145, 
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147-151, 153, 156-158 and 160. Twenty-three clones (SEQ ID NO: 124-126, 128-130, 
132-136, 138-144, 146, 152, 154, 155 and 159) were found to show some homology to 
previously identified ESTs. An additional ten clones (SEQ ID NO: 161-170) were 
found to have some degree of homology to known genes. Larger cDNA clones 
5 containing the P20 sequence represent spHce variants of a gene referred to as P703P. 
The determined DNA sequence for the variants referred to as DEI, DEI 3 and DEI 4 are 
provided in SEQ ID NOS: 171, 175 and 177, respectively, with the corresponding 
predicted amino acid sequences being provided in SEQ ID NO: 172, 176 and 178, 
respectively. The determined cDNA sequence for an extended spliced fomi of P703 is 

10 provided in SEQ ID NO: 225. The DNA sequences for the splice variants referred to as 
DE2 and DE6 are provided in SEQ ID NOS: 173 and 174, respectively. 

nxRNA Expression levels for representative clones in tumor tissues 
(prostate (n=5), breast (n=2), colon and lung) normal tissues (prostate (n=5)5 colon, 
kidney, liver, lung (n=2), ovary (n==2), skeletal muscle, skin, stomach, small intestine 

15 and brain), and activated and non-activated PBMC was determined by RT-PCR as 
described above. Expression was examined in one sample of each tissue type unless 
otherwise indicated. 

P9 was found to be highly expressed in normal prostate and prostate 
tumor compared to all normal tissues tested except for normal colon which showed 

20 comparable expression. P20, a portion of the P703P gene, was found to be highly 
expressed in normal prostate and prostate tumor, compared to all twelve normal tissues 
tested. A modest increase in expression of P2D in breast tumor (n=2), colon tumor and 
lung tumor was seen compared to all normal tissues except lung (1 of 2). Increased 
expression of PI 8 was found in nonnal prostate, prostate tumor and breast tumor 

25 compared to other normal tissues except lung and stomach. A modest increase in 
expression of P5 was observed in normal prostate compared to most other normal 
tissues. However, some elevated expression was seen in normal lung and PBMC. 
Elevated expression of P5 was also observed in prostate tumors (2 of 5), breast tumor 
and one lung tumor sample. For P30, similar expression levels were seen in normal 

30 prostate and prostate tumor, compared to six of twelve other nonnal tissues tested. 
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Increased expression was seen in breast tumors, one lung tumor sample and one colon 
tumor sample, and also in normal PBMC. P29 was found to be over-expressed in 
prostate tumor (5 of 5) and normal prostate (5 of 5) compared to the majorit>' of normal 
tissues. However, substantial expression of P29 was obser\'ed in normal colon and 
5 normal lung (2 of 2). P80 was found to be over-expressed in prostate tumor (5 of 5) and 
normal prostate (5 of 5) compared to all other norma] tissues tested, with increased 
expression also being seen in colon tumor. 

Further studies resulted in the isolation of twelve additional clones, 
hereinafter referred to as 10-d8, 10-hlO, ll-c8, 7-g6, 8-b5, 8-b6, 8-d4, 8-d9. 8-g3. 8- 
10 hll, 9-fl2 and 9-f3. The determined DNA sequences for lO-dS, 10-hlO, 1 ]-c8, 8-d4, 8- 
d9, 8-hl 1, 9-n2 and 9-f3 are provided in SEQ ID NO: 207, 208, 209, 216, 217, 220, 
221 and 222, respectively. The detem^ined forward and reverse DNA sequences for 7- 
g6. 8-b5, 8-b6 and 8-g3 are provided in SEQ ID NO: 210 and 211; 212 and 213; 214 
and 215; and 218 and 219, respectively. Comparison of these sequences with those in 
15 the gene bank revealed no significant homologies to the sequence of 9~f3. The clones 
10-d8, ll-c8 and 8-hll were found to show some homology to previously isolated 
ESTs, while 10-hlO, S-b5, 8-b6, 8-d4, 8-d9, 8-g3 and 9-fl2 were found to show some 
homology to previously identified genes. Further characterization of 7-G6 and 8-G3 
showed identity to the known genes PAP and PSA, respectively. 

mRNA expression levels for these clones were determined using the 
micro-array technology described above. The clones 7-G6, 8-G3, 8-B5, 8-B6, 8-D4, 8- 
D9, 9-F3, 9-F12, 9-H3, 10-A2, 10-A4, 11-C9 and 11-F2 were found to be over- 
expressed in prostate tumor and normal prostate, with expression in other tissues tested 
being low or undetectable. Increased expression of 8-Fll w^as seen in prostate tumor 
IS and normal prostate, bladder, skeletal muscle and colon. Increased expression of 10- 
HIO was seen in prostate tumor and normal prostate, bladder, lung, colon, brain and 
large intestine. Increased expression of 9-Bl w^as seen in prostate tumor, breast tumor, 
and normal prostate, salivary gland, large intestine and skin, with increased expression 
of 1 1-C8 being seen in prostate tumor, and nonnal prostate and large intestine. 
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An additional cDNA fragment derived from the PCR-based normal 
prostate subtraction, described above, was found to be prostate specific by both micro- 
array technology and RT-PCR. The determined cDNA sequence of this clone (referred 
to as 9-Al 1) is provided in SEQ ID NO: 226. Comparison of this sequence with those 
5 in the public databases revealed 99% identity to the known gene HOXB13. 

Further studies led to the isolation of the clones 8-C6 and 8-H7. The 
determined cDNA sequences for these clones are provided in SEQ ID NO: 227 and 228, 
respectively. These sequences were found to show some homology to previously 
isolated ESTs. 

10 PGR and hybridization-based methodologies were employed to obtain 

longer cDNA sequences for clone P20 (also referred to as P703P), yielding three 
additional cDNA fragments that progressively extend the 5' end of the gene. These 
fragments, referred to as P703PDE5, P703P6.26, and P703PX-23 (SEQ ID NO: 326, 
328 and 330, with the predicted corresponding amino acid sequences being provided in 

15 SEQ ID NO: 327, 329 and 331, respectively) contain additional 5' sequence. 
P703PDE5 was recovered by screening of a cDNA library (#141-26) with a portion of 
P703P as a probe. P703P6.26 was recovered from a mixture of three prostate tumor 
cDNAs and P703PX_23 was recovered from cDNA library (#438-48). Together, the 
additional sequences include all of the putative mature serine protease along with part of 

20 the putative signal sequence. The full-length cDNA sequence for P703P is provided in 
SEQ ID NO: 524, with the corresponding amino acid sequence being provided in SEQ 
ID NO: 525. 

Using computer algorithms, the following regions of P703P were 
predicted to represent potential HLA A2-binding CTL epitopes: amino acids 164-172 

25 of SEQ ID NO: 525 (SEQ ID NO: 723); amino acids 160-168 of SEQ ID NO: 525 
(SEQ ID NO: 724); amino acids 239-247 of SEQ ID NO: 525 (SEQ ID NO: 725); 
amino acids 1 18-126 of SEQ ID NO: 525 (SEQ ID NO: 726); amino acids 1 12-120 of 
SEQ ID NO: 525 (SEQ ID NO: 727); amino acids 155-164 of SEQ ID NO: 525 (SEQ 
ID NO: 728); amino acids 117-126 of SEQ ID NO: 525 (SEQ ID NO: 729); amino acids 

30 164-173 of SEQ ID NO: 525 (SEQ ID NO: 730); amino acids 154-163 of SEQ ID NO: 



BNSDOCID <WO 0151633A2 I > 



wo 01 51633 



PC T l S(M/01574 



135 

525 (SEQ ID NO: 731): amino acids 163-172 of SEQ ID NO: 525 (SEQ ID NO: 732): 
amino acids 58-66 of SEQ ID NO: 525 (SEQ ID NO: 733); and amino acids 59-67 of 
SEQ ID NO: 525 (SEQ ID NO: 734). 

P703P was found to show some homology to previously identified 
5 proteases, such as thrombin. The thrombin receptor has been shown to be preferentially 
expressed in highly metastatic breast carcinoma cells and breast carcinoma biopsy 
samples. Introduction of thrombin receptor antisense cDNA has been shown to inhibit 
the invasion of metastatic breast carcinoma cells in culture. Antibodies against 
thrombin receptor inliibit thrombin receptor activation and thrombin-induced platelet 
10 activation. Furthermore, peptides that resemble the receptor's tethered ligand domain 
inhibit platelet aggregation by thrombin. P703P may play a role in prostate cancer 
through a protease-activated receptor on the cancer cell or on stromal cells. The 
potential tr>'psin-like protease activity of P703P may either activate a protease-activated 
receptor on the cancer cell membrane to promote tumorgenesis or activate a protease- 
15 activated receptor on the adjacent cells (such as stromal cells) to secrete growth factors 
and/or proteases (such as matrix metalloproteinases) that could promote tumor 
angiogenesis, invasion and metastasis. P703P may thus promote tumor progression 
and/or metastasis through the activation of protease-activated receptor. Polypeptides 
and antibodies that block the P703P-receptor interaction may therefore be usefully 
20 employed in the treatment of prostate cancer. 

To determine whether P703P expression increases with increased 
severity of Gleason grade, an indicator of tumor stage, quantitative PGR analysis was 
performed on prostate tumor samples with a range of Gleason scores from 5 to > 8. The 
mean level of P703P expression increased with increasing Gleason score, indicating that 
25 P703P expression may correlate with increased disease severity. 

Further studies using a PCR-based subtraction library of a prostate tumor 
pool subtracted against a pool of normal tissues (referred to as JP: PGR subtraction) 
resulted in the isolation of thirteen additional clones, seven of which did not share any 
significant homology to known GenBank sequences. The determined cDNA sequences 
30 for these seven clones (P71 IP, P712P, novel 23, P774P, P775P, P710P and P768P) are 
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provided in SEQ ID NO: 307-31 1, 313 and 315, respectively. The remaining six clones 
(SEQ ID NO: 316 and 321-325) were shown to share some homology to known genes. 
By microarray analysis, all thirteen clones showed three or more fold over-expression in 
prostate tissues, including prostate tumors, BPH and normal prostate as compared to 
5 normal non-prostate tissues. Clones P711P, P712P, novel 23 and P768P showed over- 
expression in most prostate tumors and BPH tissues tested (n=29), and in the majority 
of normal prostate tissues (n=4), but background to low expression levels in all normal 
tissues. Clones P774P, P775P and P710P showed comparatively lower expression and 
expression in fewer prostate tumors and BPH samples, with negative to low expression 
10 in normal prostate. 

Further studies led to the isolation of an extended cDNA sequence for 
P712P (SEQ ID NO: 552). The amino acid sequences encoded by 16 predicted open 
reading frames present within the sequence of SEQ ID NO: 552 are provided in SEQ ID 
NO: 553-568. 

15 The full-length cDNA for P71 IP was obtained by employing the partial 

sequence of SEQ ID NO: 307 to screen a prostate cDNA library. Specifically, a 
directionally cloned prostate cDNA library was prepared using standard techniques. 
One million colonies of this library were plated onto LB/Amp plates. Nylon membrane 
filters were used to lift these colonies, and the cDNAs which were picked up by these 

20 filters were denatured and cross-linked to the filters by UV light. The P711P cDNA 
fragment of SEQ ID NO: 307 was radio-labeled and used to hybridize with these filters. 
Positive clones were selected, and cDNAs were prepared and sequenced using an 
automatic Perkin Elmer/Applied Biosystems sequencer. The determined full-length 
sequence of P71 IP is provided in SEQ ID NO: 382, with the corresponding predicted 

25 amino acid sequence being provided in SEQ ID NO: 383. 

Using PCR and hybridization-based methodologies, additional cDNA 
sequence information was derived for two clones described above, 11 -C9 and 9-F3, 
herein after referred to as P707P and P714P, respectively (SEQ ID NO: 333 and 334). 
After comparison with the most recent GenBank, P707P was found to be a splice 

30 variant of the known gene HoxB13. In contrast, no significant homologies to P714P 
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were found. Further studies employing the sequence of SEQ ID NO: 334 as a probe in 
standard full-length cloning methods, resulted in an extended cDNA sequence for 
P714P. 1 his sequence is provided in SEQ ID NO: 619. This sequence was found to 
show some homology to the gene that encodes human ribosomal L23 A protein. 
5 Clones 8-B3, P89, P98, P130 and P201 (as disclosed in U.S. Patent 

Application No. 09/020,956, filed February 9, 1998) were found to be contained within 
one contiguous sequence, referred to as P705P (SEQ ID NO: 335, with the predicted 
amino acid sequence provided in SEQ ID NO: 336), which w^as determined to be a 
splice variant of the known gene NKX 3.1. 
^0 f urther studies on P775P resulted in the isolation of four additional 

sequences {SFO ID NO: 473-476) which are all splice variants of the P775P gene. The 
sequence of SPQ ID NO: 474 was found to contain two open reading frames (ORPs). 
The predicted amino acid sequences encoded by these ORFs are provided in SEQ ID 
NO: 477 and 478. 1 he cDNA sequence of SEQ ID NO: 475 was found to contain an 
15 ORF wliich encodes the amino acid sequence of SEQ ID NO: 479. The cDNA 
sequence of SEQ ID NO: 473 was found to contain four ORFs. The predicted amino 
acid sequences encoded by these ORFs are provided in SEQ ID NO: 480-483. 
Additional splice variants of P775P are provided in SEQ ID NO: 593-597. 

Subsequent studies led to the identification of a genomic region on 
20 chromosome 22ql 1.2, knowTi as the Cat Eye Syndrome region, that contains the five 
prostate genes P704P, P712P, P774P, P775P and B305D. The relative location of each 
of these five genes within the genomic region is shown in Fig. 10. This region may 
therefore be associated with malignant tumors, and other potential tumor genes may be 
contained within this region. These studies also led to the identification of a potential 
25 open reading frame (ORF) for P775P (provided in SEQ ID NO: 533), which encodes 
the amino acid sequence of SEQ ID NO: 534. 

Comparison of the clone of SEQ ID NO: 325 (referred to as P558S) with 
sequences in the GenBank and GeneSeq DNA databases showed that P558S is identical 
to the prostate-specific transglutaminase gene, which is known to have two forms. The 
"^0 full-length sequences for the two forms are provided in SEQ ID NO: 630 and 631, with 
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the corresponding amino acid sequences being provided in SEQ ID NO: 632 and 633, 
respectively. The cDNA sequence of SEQ ID NO: 631 has a 15 pair base insert, 
resulting in a 5 amino acid insert in the corresponding amino acid sequence (SEQ ID 
NO: 633). This insert is not present in the sequence of SEQ ID NO: 630. 
5 Further studies on P768P (SEQ ID NO: 315) led to the identification of 

the putative full-length open reading frame (ORE). The cDNA sequence of the ORE 
with stop codon is provided in SEQ ID NO: 764. The cDNA sequence of the ORE 
without stop codon is provided in SEQ ID NO: 765, with the corresponding amino acid 
sequence being provided in SEQ ID NO: 766. This sequence was found to show 86% 

10 identity to a rat calcium transporter protein, indicating that P768P may represent a 
human calcium transporter protein. The locations of transmembrane domains within 
P768P were predicted using the PSORT II computer algorithm. Six transmembrane 
domains were predicted at amino acid positions 118-134, 172-188, 211-227, 230-246, 
282-298 and 348-364. The amino acid sequences of SEQ ID NO: 767-772 represent 

15 amino acids M34, 135-188, 189-227, 228-246, 247-298 and 299-511 of P768P, 
respectively. 



20 



EXAMPLE 4 
Synthesis of Polypeptides 



Polypeptides may be synthesized on a Perkin Elmer/Applied Biosystems 
43 OA peptide synthesizer using FMOC chemistry with HPTU (O-Benzotriazole- 
N,N,N',N'-tetramethyluronium hexafluorophosphate) activation. A Gly-Cys-Gly 
sequence may be attached to the amino terminus of the peptide to provide a method of 

25 conjugation, binding to an immobilized surface, or labeling of the peptide. Cleavage of 
the peptides from the solid support may be carried out using the following cleavage 
mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:plienol (40:1:2:2:3). After 
cleaving for 2 hours, the peptides may be precipitated in cold methyl-t-butyl-ether. The 
peptide pellets may then be dissolved in water containing 0.1% trifluoroacetic acid 

30 (TEA) and lyophilized prior to purification by CI 8 reverse phase HPLC. A gradient of 
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0*^0-60% acetonitrile (containing 0.1% TFA) in water (containing 0.1% TFA) may be 
used to elute the peptides. Following lyophilization of the pure fractions, the peptides 
may be chaiacterized using electrospray or other ty pes of mass spectrometr\' and by 
amino acid analysis. 

5 

EX.AMPLE 5 
Further Isolation And Characterization Of 
Prostate-Specific Polypeptides By Pcr-Based Subtflaction 

A cDNA libraiy generated from prostate primary tumor mRNA as 
described above was subtracted with cDNA from normal prostate. The subtraction w^as 
perfonned using a PCR-based protocol (Clontech), which was modified to generate 
larger fragments. Within this protocol, tester and driver double stranded cDNA were 
separately digested with five restriction enzymes that recognize six-nucleotide 
1 5 restriction sites (Mlul, MscI, PvuII, Sail and StuI). This digestion resulted in an average 
cDNA size of 600 bp, rather than the average size of 300 bp that results from digestion 
with Rsal according to the Clontech protocol. This modification did not affect the 
subtraction efficiency. Two tester populations were then created with different 
adapters, and the driver library remained without adapters, 

-f^ The tester and driver libraries were then hybridized using excess driver 

cDNA. In the first hybridization step, driver was separately hybridized with each of the 
two tester cDNA populations. This resulted in populations of (a) unliybridized tester 
cDNAs, (b) tester cDNAs hybridized to other tester cDNAs, (c) tester cDNAs 
hybridized to driver cDNAs and (d) unhybridized driver cDNAs. Ihe two separate 

25 hybridization reactions were then combined, and rehybridized in the presence of 
addifional denatured driver cDNA. Following this second hybridization, in addition to 
populations (a) through (d), a fifth population (e) was generated in which tester cDNA 
v^'ith one adapter hybridized to tester cDNA with the second adapter. Accordingly, the 
second hybridization step resulted in enricliment of differentially expressed sequences 

30 which could be used as templates for PGR amplification with adaptor-specific primers. 
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ITie ends were then filled in, and PCR amplification was performed 
using adaptor-specific primers. Only population (e), which contained tester cDNA that 
did not hybridize to driver cDNA, was amplified exponentially. A second PCR 
amplification step was then performed, to reduce background and further enrich 
5 differentially expressed sequences. 

This PCR-based subtraction technique normalizes differentially 
expressed cDN As so that rare transcripts that are overexpressed in prostate tumor tissue 
may be recoverable. Such transcripts would be difficult to recover by traditional 
subtraction methods. 

10 In addition to genes known to be overexpressed in prostate tumor, 

sevent\'-se\'en further clones were identified. Sequences of these partial cDNAs are 
provided in SEQ ID NO: 29 to 305. Most of these clones had no significant homology 
to database sequences. Exceptions were JPTPN23 (SEQ ID NO: 231; similarity to pig 
valosin-containing protein), JPTPN30 (SEQ ID NO: 234; similarity to rat mKNA for 

15 proteasome subunit), JPTPN45 (SEQ ID NO: 243; similarity to rat jion^egicus cytosolic 
NADP-dcpendent isocitrate dehydrogenase), JPTPN46 (SEQ ID NO: 244; similarity to 
human subclone H8 4 d4 DNA sequence), JP1D6 (SEQ ID NO: 265; similarity to G. 
galliis dynein light chain-A), JP8D6 (SEQ ID NO: 288; similarity to human BAG clone 
RG016J04), JP8F5 (SEQ ID NO: 289; similarity to human subclone H8 3 b5 DNA 

20 sequence), and JP8E9 (SEQ ID NO: 299; similarity to human Alu sequence). 

Additional studies using the PCR-based subtraction library consisting of 
a prostate tumor pool subtracted against a normal prostate pool (referred to as PT-PN 
PCR subtraction) yielded three additional clones. Comparison of the cDNA sequences 
of these clones w^ith the most recent release of GenBank revealed no significant 

25 homologies to the two clones referred to as P715P and P767P (SEQ ID NO: 312 and 
314). The remaining clone was found to show some homology to the known gene 
KIAA0056 (SEQ ID NO: 318). Using microarray analysis to measure mRNA 
expression levels in various tissues, all tliree clones were found to be over-expressed in 
prostate tumors and BPH tissues. Specifically, clone P715P was over-expressed in most 

30 prostate tumors and BPH tissues by a factor of three or greater, with elevated expression 
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seen in the majority of nonnal prostate samples and in fetal tissue, but negative to low- 
expression in all other normal tissues. Clone P767P was over-expressed in several 
prostate tumors and BPH tissues, w ith moderate expression levels in half of the nonnal 
prostate samples, and background to low expression in all other nonnal tissues tested. 
> Further analysis, by microairay as described above, of the PT-PN PCR 

subtraction library and of a DNA subtraction librar}^ containing cDNA from prostate 
tumor subtracted with a pool of nonnal tissue cDNAs, led to the isolation of 27 
additional clones (SEQ ID NO: 340-365 and 381) which were determined to be over- 
expressed in prostate tumor. The clones of SEQ ID NO: 341, 342, 345, 347, 348, 349, 

10 35K 355-359, 36L 362 and 364 w^ere also found to be expressed in nonnal prostate. 
Expression of all 26 clones in a variety of normal tissues was found to be low or 
undetectable, with the exception of P544S (SEQ ID NO: 356) which was found to be 
expressed in small intestine. Of the 26 clones, 1 1 (SEQ ID NO: 340-349 and 362) w^ere 
found to show some homology to previously identified sequences. No significant 

1 5 homologies were found to the clones of SEQ ID NO: 350, 351, 353-361, and 363-365. 

Comparison of the sequence of SEQ ID NO: 362 with sequences in the 
GenBank and GeneSeq DNA databases showed that this clone (referred to as P788P) is 
identical to GeneSeq Accession No. X27262, which encodes a protein found in the 
GeneSeq protein Accession No. Y00931. The full length cDNA sequence of P788P is 

20 provided in SEQ ID NO: 634, with the corresponding predicted amino acid being 
provided in SEQ ID NO: 635. Subsequently, a full-length cDNA sequence for P788P 
that contains polymorphisms not found in the sequence of SEQ ID NO: 634, was cloned 
multiple times by PCR amplification from cDNA prepared from several RNA templates 
from three individuals. This determined cDNA sequence of this polymorphic variant of 

25 P788P is provided in SEQ ID NO: 636, with the corresponding amino acid sequence 
being provided in SEQ ID NO: 637. The sequence of SEQ ID NO: 637 differs from 
that of SEQ ID NO: 635 by six amino acid residues. The P788P protein has 7 potential 
transmembrane domains at the C-terminal portion and is predicted to be a plasma 
membrane protein with an extracellular N-terminal region. 
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Further studies on the clone of SEQ ID NO: 352 (referred to as P790P) 
led to the isolation of the full-length cDNA sequence of SEQ ID NO: 526. The 
corresponding predicted amino acid is provided in SEQ ID NO: 527. Data from two 
quantitative PGR experiments indicated that P790P is over-expressed in 11/15 tested 
5 prostate tumor samples and is expressed at low levels in spinal cord, with no expression 
being seen in all other normal samples tested. Data from further PGR experiments and 
microarray experiments showed over-expression in normal prostate and prostate tumor 
with Httle or no expression in other tissues tested. P790P was subsequently foimd to 
show significant homology to a previously identified G-protein coupled prostate tissue 
1 0 receptor. 

Additional studies on the clone of SEQ ID NO: 354 (referred to as 
P776P) led to the isolation of an extended cDNA sequence, provided in SEQ ID NO: 
569. The determined cDNA sequences of three additional splice variants of P776P are 
provided in SEQ ID NO: 570-572. The amino acid sequences encoded by two predicted 

15 open reading frames (ORFs) contained within SEQ ID NO: 570, one predicted ORE 
contained within SEQ ID NO: 571, and 11 predicted ORFs contained within SEQ ID 
NO: 569, are provided in SEQ ID NO: 573-586, respectively. Further studies led to the 
isolation of the full-length sequence for the clone of SEQ ID NO: 570 (provided in SEQ 
ID NO: 737). Full-length cloning efforts on the clone of SEQ ID NO: 571 led to the 

20 isolation of two sequences (provided in SEQ ID NO: 738 and 739), representing a 
single clone, that are identical with the exception of a polymorphic insertion/deletion at 
position 1293. Specifically, the clone of SEQ ID NO: 739 (referred to as clone Fl) has 
a C at position 1293. The clone of SEQ ID NO: 738 (referred to as clone F2) has a 
single base pair deletion at position 1293. The predicted amino acid sequences encoded 

25 by 5 open reading frames located within SEQ ID NO: 737 are provided in SEQ ID NO: 
740-744, with the predicted amino acid sequences encoded by the clone of SEQ ID NO: 
738 and 739 being provided in SEQ ID NO: 745-750. 

Gomparison of the cDNA sequences for the clones P767P (SEQ ID NO: 
314) and P777P (SEQ ID NO: 350) with sequences in the GenBank human EST 

30 database showed that the two clones matched many EST sequences in conmion, 
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suggesting that P767P and P777P may represent the same gene. A DNA consensus 
sequence derived from a DNA sequence alignment of P767P, P777P and multiple EST 
clones is provided in SEQ ID NO: 587. The amino acid sequences encoded by three 
putative ORFs located within SEQ ID NO: 587 are provided in SEQ ID NO: 588-590. 

The clone of SEQ ID NO: 342 (referred to as P789P ) was found to show 
homology to a previously identified gene. Ihe full length cDNA sequence for P789P 
and the corresponding amino acid sequence are provided in SEQ ID NO: 735 and 736, 
respectively. 

EXAMPLE 6 

Peptide Priming Of Mice And Propagation Of Ctl Lines 

6.1. This Example illustrates the preparation of a CTL cell line specific 
for cells expressing the P502S gene. 

Mice expressing the transgene for human HLA A2Kb (provided by Dr L. 
Sherman, The Scripps Research Institute, La Jolla, CA) were immunized with P2S#12 
peptide (VLGWAEL; SEQ ID NO: 306), which is derived from the P502S gene (also 
referred to herein as Jl-17, SEQ ID NO: 8), as described by Theobald et ah, Proc. Natl. 
Acad, Scl USA P2:11993-11997, 1995 with the following modifications. Mice were 
immunized with 100|ag of P2S#I2 and 120|Ag of an I-A^ binding peptide derived from 
hepatitis B Virus protein emulsified in incomplete Freund's adjuvant. Three weeks later 
these mice were sacrificed and using a nylon mesh single cell suspensions prepared. 
Cells were then resuspended at 6 x 10^ cells/ml in complete media (RPMI-1640; Gibco 
BI^, Gaithersburg, MD) containing 10% PCS, 2mM Glutamine (Gibco BRL), sodium 
pyruvate (Gibco BRL), non-essential amino acids (Gibco BRL), 2 x 10'^ M 2- 
mercaptoethanol, 50U/ml penicillin and streptomycin, and cultured in the presence of 
irradiated (3000 rads) P2S^/12-puIsed (5mg/ml P2S#12 and lOmg/ml p2-microglobulin) 
LPS blasts (A2 transgenic spleens cells cultured in the presence of 7|ag/ml dextran 
sulfate and 25^g/ml LPS for 3 days). Six days later, cells (5 x 10^/ml) were 
rcstunulated with 2.5 x 10^/mI peptide pulsed irradiated (20,000 rads) EL4A2Kb cells 



wo 01/51633 



PCT/lJS(n/01574 



144 

(Shennan et al. Science 25^:815-818, 1992) and 3 x 10^/ml A2 transgenic spleen feeder 
cells. Cells were cultured in the presence of 20U/ml IL~2. Cells continued to be 
restimulated on a weekly basis as described, in preparation for cloning the line. 

P2S#12 line was cloned by limiting dilution analysis with peptide pulsed 
5 EL4 A2Kb tumor cells (1 x lO"* cells/ well) as stimulators and A2 transgenic spleen 
cells as feeders ( 5 x 10^ cells/ well) grown in the presence of 30U/ml lL-2. On day 14, 
cells were restimulated as before. On day 21, clones that were growing were isolated 
and maintained in culture. Several of tliese clones demonstrated significantly higher 
reactivity (lysis) against human fibroblasts (HLA A2Kb expressing) transduced with 
1 0 P502S than against control fibroblasts. An example is presented in Figure 1 . 

This data indicates that P2S #12 represents a naturally processed epitope 
of the P502S protein that is expressed in the context of the human HLA A2Kb 
molecule. 

1 5 6.2. This Example illustrates the preparation of murine CTL lines and 

CTL clones specific for cells expressing the P501S gene. 

This series of experiments were performed similarly to that described 
above. Mice were immunized with the P1S#10 peptide (SEQ ID NO: 337), which is 

20 derived from the P501S gene (also referred to herein as Ll-12, SEQ ID NO: 110). The 
P1S#10 peptide was derived by analysis of the predicted polypeptide sequence for 
P501S for potential HLA-A2 binding sequences as defined by published HLA-A2 
binding motifs (Parker, KG, et al, J. hnmunoL, 152:163, 1994). P1S#10 peptide was 
synthesized as described in Example 4, and empirically tested for HLA-A2 binding 

25 using a T cell based competition assay. Predicted A2 binding peptides were tested for 
their ability to compete HLA-A2 specific peptide presentation to an HLA-A2 restricted 
CTL clone (D150M58), which is specific for the HLA-A2 binding influenza matrix 
peptide fluM58. D150M58 CTL secretes TNF in response to self-presentation of 
peptide fluM58. In the competition assay, test peptides at 100-200 p.g/ml were added to 

30 cultures of D150M58 CTL in order to bind HLA-A2 on the CTL. After thirty minutes. 
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C TL cultured with test peptides, or control peptides, were tested for their ciiitigen dose 
response to the fluM5S peptide in a standard TNF bioassay. As shown in Figure 3, 
peptide P]S#10 competes HLA-A2 restricted presentation of fIuM58, demonstrating 
that peptide PI S#] 0 binds HLA-A2. 

5 Mice expressing the transgene for human HLA A2Kb were immunized 

as described by Theobald et al. {Proc. Natl Acad Sd. USA 92:11993-11997, 1995) 
with the following modifications. Mice were immunized with 62.5pg of PIS #10 and 
120|ig of an 1-A^ binding peptide derived from Hepatitis B Virus protein emulsified in 
incomplete Freund's adjuvant. Three weeks later these mice were sacrificed and single 

0 cell suspensions prepared using a nylon mesh. Cells were then resuspended at 6 x 10^ 
cells/ml in complete media (as described above) and cultured in the presence of 
irradiated (3000 rads) PI S#10-pulsed {2jig/xn\ P1S#10 and lOmg/ml p2-microglobulin) 
LPS blasts (A2 transgenic spleens cells cultured in the presence of 7|ig/ml dextran 
sulfate and 25iag/ml LPS for 3 days). Six days later cells (5 x 10^/ml) were reslimulated 

5 with 2.5 X lOVml peptide-pulsed irradiated (20,000 rads) EL4A2Kb cells, as described 
above, and 3 x lOVml A2 transgenic spleen feeder cells. Cells were cultured in the 
presence of 20 U/ml IL-2. Cells were restimulated on a weekly basis in preparation for 
cloning. After three rounds of in vitro stimulations, one line was generated that 
recognized PlS#10-pulsed Jurkat A2Kb targets and P501S-transduced Jurkat targets as 

0 shown in Figure 4. 

A PlS#10-specific CTL line was cloned by limiting dilution analysis 
with pepfide pulsed EL4 A2Kb tumor cells (1 x lO'^ cells/ well) as stimulators and A2 
transgenic spleen cells as feeders (5 x 10^ cells/ well) growxi in the presence of 30U/ml 
{L-2. On day 14, cells were restimulated as before. On day 21, viable clones were 

5 isolated and maintained in culture. As shown in Figure 5, five of these clones 
demonstrated specific cytolytic reactivity against P501S-transduced Jurkat A2Kb 
targets. This data indicates that P1S#10 represents a naturally processed epitope of the 
P50 IS protein that is expressed in the context of the human HLA-A2.1 molecule. 
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EXAMPLE 7 

Priming of ctl in vivo using naked dna immunization 

With a prostate antigen 
The prostate-specific antigen LI -12, as described above, is also referred 
5 to as P501S. HLA A2Kb Tg mice (provided by Dr L. Sherman, The Scripps Research 
Institute, La Jolla, CA) were immunized with 100 \xg P501S in the vector VR1012 
either intramusculaiiy or intradermally. The mice were immimized three times, with a 
two week interval between immunizations. Two weeks after the last immunization, 
immune spleen cells were cultured with Jurkat A2Kb-P501S transduced stimulator 
10 cells. CTL lines were stimulated weekly. After two weeks of in vitro stimulation, CTL 
activity was assessed against P501S transduced targets. Two out of 8 mice developed 
strong anti-P501S CTL responses. These results demonstrate that P501S contains at 
least one naturally processed HLA-A2-restricted CTL epitope. 

15 EXAMPLE 8 

Ability Of Human T Cells To Recognize Prostate-Specific Polypeptides 

This Example illustrates the ability of T cells specific for a prostate 
tumor polypeptide to recognize human tumor. 

20 Human CD 8"^ T cells were primed in vitro to the P2S-12 peptide (SEQ 

ID NO: 306) derived from P502S (also referred to as Jl-17) using dendritic cells 
according to the protocol of Van Tsai et al. (Critical Reviews in Immunology 75:65-75, 
1998). The resulting CD8^ T cell microcultures were tested for their ability to 
recognize the P2S-12 peptide presented by autologous fibroblasts or fibroblasts which 

25 were transduced to express the P502S gene in a y-interferon ELISPOT assay (see 
Lalvani et ah, J. Exp. Med 756:859-865, 1997). Briefly, titrating numbers of T cells 
were assayed in duplicate on 10'* fibroblasts in the presence of 3 |ig/m] human P2- 
microglobulin and I fig/ml P2S-12 peptide or control E75 peptide: In addition, T cells 
were simultaneously assayed on autologous fibroblasts transduced with the P502S gene 

30 or as a control, fibroblasts transduced with HER-2/;?ew. Prior to the assay, the 
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fibroblasts were treated with 10 ng/ml y-interferon for 48 hours to upregulale class I 
MHC expression. One of the microcultures (#5) demonstrated strong recognition of 
both peptide pulsed fibroblasts as well as transduced fibroblasts in a y-interferon 
ELISPOT assay. Figure 2A demonstrates that there was a strong increase in the number 
5 of y-interferon spots with increasing numbers of T cells on fibroblasts pulsed with the 
P2S-12 peptide (solid bars) but not with the control E75 peptide (open bars). This 
show^s the ability of these T cells to specifically recognize the P2S-12 peptide. As 
showTi in Figure 2B, this microculture also demonstrated an increase in the number of y- 
interferon spots with increasing numbers of T cells on fibroblasts transduced to express 

10 the P502S gene but not the HER-l/neu gene. These results provide additional 
confirmatory evidence that the P2S-12 peptide is a naturally processed epitope of the 
P502S protein. Furthermore, this also demonstrates that there exists in the human T cell 
repertoire, high affinity T cells which are capable of recognizing this epitope. These T 
cells should also be capable of recognizing human tumors which express the P502S 

15 gene. 

EXAMPLE 9 

Elicitation Of Prostate Antigen-Specific Ctl Responses 
In Human Blood 

20 

This Example illustrates the ability of a prostate-specific antigen to elicit 
a CI L response in blood of normal humans. 

Autologous dendritic cells (DC) were differentiated from monocyte 
cultures derived from PBMC of normal donors by growth for five days in RPMI 

25 medium containing 10% human serum, 50 ng/ml GMCSF and 30 ng/mJ IL-4. 
Follovv'ing culture, DC were infected overnight with recombinant P501 S-expressing 
vaccinia virus at an M.O.L of 5 and matured for 8 hours by the addition of 2 
micrograms/ml CD40 ligand. Virus was inactivated by UV irradiation, CD8^ cells were 
isolated by positive selection using magnetic beads, and priming cultures were initiated 

30 m 24-well plates. Following five stimulation cycles using autologous fibroblasts 
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retrovirally transduced to express P501S and CD80, CD8-f lines were identified that 
specifically produced interferon-gamma when stimulated with autologous P501S- 
transduced fibroblasts. The P501S-specific activity of cell line 3A-1 could be 
maintained following additional stimulation cycles on autologous B-LCL transduced 
5 with P501S. Line 3A-1 was shown to specifically recognize autologous B-LCL 
transduced to express P501S, but not EGFP-transduced autologous B-LCL, as measured 
by cytotoxicity assays (^*Cr release) and interferon-gamma production (Interferon- 
gamma Elispot; see above and Lalvani et al., J. Exp. Med J 86:859-865, 1997). The 
results of these assays are presented in Figures 6A and 6B. 

10 

EXAMPLE 10 

Identification Of A Naturally Processed Ctl Epitope Contained Within the 

Prostate-Specific Antigen P703P 

15 The 9-mer peptide p5 (SEQ ID NO: 338) was derived from the P703P 

antigen (also referred to as P20). The p5 peptide is immunogenic in human HLA-A2 
donors and is a naturally processed epitope. Antigen specific himaan CD8+ T cells can 
be primed following repeated //? vitro stimulations with monocytes pulsed with p5 
peptide. These CTL specifically recognize p5-pulsed and P703P-transduced target cells 

20 in both ELISPOT (as described above) and chromium release assays. Additionally, 
immunization of HLA-A2Kb transgenic mice with p5 leads to the generation of CTL 
lines which recognize a variety of HLA-A2Kb or HLA-A2 transduced target cells 
expressing P703P. 

Initial studies demonstrating that p5 is a naturally processed epitope were 

25 done using HLA-A2Kb transgenic mice. HLA-A2Kb transgenic mice were immunized 
subcutaneously in the footpad with 100 ^ig of p5 peptide together with 140 yig of 
hepatitis B virus core peptide (a Th peptide) in Freund's incomplete adjuvant. Three 
weeks post immunization, spleen cells from immunized mice were stimulated in vitro 
with peptide-pulsed LPS blasts. CTL activity was assessed by chromium release assay 

30 five days after primary //? vitro stimulation. Retrovirally transduced cells expressing the 
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control antigen P7()3P and HLA-A2Kb were used as targets. CTL lines that specifically 

rccogni/eJ both p5-pulsed targets as well as P703P-expressing targets w^ere identified. 

Human in vitro priming experiments demonstrated that the p5 peptide is 

immunogenic in humans. Dendritic cells (DC) were differentiated from monocyte 

cultures dcri\ed from PBMC of normal human donors by culturing for five days in 

IIPMI medium containing 10% human serum, 50 ng/ml human GM-CSF and 30 ng/ml 

human IL-4. Following culture, the DC were pulsed with 1 ug/ml p5 peptide and 

cultured \^ith CDK^ T cell enriched PBMC. CTL lines were restimulated on a weekly 

basis uiih p5-pulsed monocytes. Five to six weeks after initiation of the CTL cultures, 

10 CI L rcci>gniiion of p5-pulsed target cells was demonstrated. CTL were additionally 

shown (o recogni/jc human cells transduced to express P703P, demonstrating that p5 is 

a natural 1\ prcKcssed epitope. 

Studies identifying a further peptide epitope (referred to as peptide 4) 

derived Irom the prostate tumor-specific antigen P703P that is capable of being 

\5 recognized by CD4 T cells on the surface of cells in the context of HLA class II 
molecules were carried out as follows. The amino acid sequence for peptide 4 is 
provided in SEQ ID NO: 638, with the corresponding cDNA sequence being provided 
in SEQ ID NO: 639. 

Twenty 15-mer peptides overlapping by 10 amino acids and derived 

20 from the carboxy-terminal fragment of P703P were generated using standard 
procedures. Dendritic cells (DC) were derived from PBMC of a normal female donor 
using GM-CSF and IL-4 by standard protocols. CD4 T cells were generated from the 
same donor as the DC using MACS beads and negafive selection. DC w^ere pulsed 
overnight w ith pools of the 15-mer peptides, with each peptide at a final concentration 

25 of 0.25 microgram/ml. Pulsed DC were washed and plated at 1 x 10^ cells/well of 96- 
well V-bottom plates and purified CD4 T cells were added at 1 x loVwell. Cultures 
were supplemented with 60 ng/ml lL-6 and 10 ng/ml IL-12 and incubated at 37 ""C. 
Cultures were restimulated as above on a weekly basis using DC generated and pulsed 
as above as antigen presenting cells, supplemented with 5 ng/ml IL-7 and 10 u/ml IL-2. 

30 Follow^ing 4 in vitro stimulation cycles, 96 lines (each line corresponding to one well) 
were tested for specific proliferation and cytokine production in response to the 
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stimulating pools with an irrelevant pool of peptides derived from manmiaglobin being 
used as a control. 

One line (referred to as 1-F9) was identified from pool #1 that 
demonstrated specific proliferation (measured by 3H proliferation assays) and cytokine 
5 production (measured by interferon-gamma ELISA assays) in response to pool #1 of 
P703P peptides. This line was ftirther tested for specific recognition of the peptide 
pool, specific recognition of individual peptides in the pool, and in HLA mismatch 
analyses to identify the relevant restricting allele. Line 1-F9 was found to specifically 
proliferate and produce interferon-gamma in response to peptide pool ^^1^ and also to 
10 peptide 4 (SEQ ID NO: 638). Pepfide 4 corresponds to amino acids 126-140 of SEQ ID 
NO: 327. Peptide titration experiments were conducted to assess the sensitivity of line 
1-F9 for the specific peptide. The line was found to specifically respond to peptide 4 at 
concentrations as low as 0.25 ng/ml, indicating that the T cells are very sensitive and 
therefore likely to have high affinity for the epitope. 
15 To determine the HLA restriction of the P703P response, a panel of 

antigen presenting cells (APC) was generated that was partially matched with the donor 
used to generate the T cells. The APC were pulsed with the peptide and used in 
proliferation and cytokine assays together with line 1-F9. APC matched with the donor 
at HLA-DRB0701 and HLA-DQB02 alleles were able to present the peptide to the T 
20 cells, indicating that the P703P-specific response is restricted to one of these alleles. 

Antibody blocking assays were utilized to determine if the restricting 
allele was HLA-DR0701 or HLA-DQ02. The anti-HLA-DR blocking antibody L243 
or an irrelevant isotype matched IgG2a were added to T cells and APC cultures 
pulsed with the pepfide RMPTVLQCVNVSVVS (SEQ ID NO: 638) at 250 ng/ml. 
25 Standard interferon-gamma and proliferation assays were performed. Whereas the 
control antibody had no effect on the ability of the T cells to recognize peptide-pulsed 
APC, in both assays the anti-HLA-DR antibody completely blocked the ability of the 
T cells to specifically recognize peptide-pulsed APC. 

To determine if the peptide epitope RMPTVLQCVNVSVVS (SEQ ID 
30 NO: 638) was naturally processed, the ability of line 1-F9 to recognize APC pulsed with 
recombinant P703P protein was examined. For these experiments a number of 
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recombinant P703P sources were utilized: E, co//-derived P703P, Pichia-dcrived P703P 
and baculovirus-derived P703P. Irrelevant protein controls used were E ro//-derived 
L3E a lung-specific antigen) and baculovirus-derived manunaglobin. In interferon- 
g^imma ELISA assays, line 1-F9 was able to efficiently recognize both£. coU fomis of 
5 P703P as well as Pichia-derived recombinant P703P, while baculovirus-derived P703P 
was recognized less efficiently. Subsequent Western blot analysis revealed that the E 
call and Pichia P703P protein preparations were intact while the baculovirus P703P 
preparation was approximately 75% degraded. Thus, peptide RA4PTVLQC\^VSVVS 
(SEQ ID NO: 638) ft-om P703P is a naturally processed peptide epitope derived fi^om 

1 0 P703P and presented to T cells in the context of HLA-DRB-0701 

In further studies, twenty -four 15-mer peptides overlapping by 10 amino 
acids and derived fi-oni the N-terminal fragment of P703P (corresponding to amino 
acids 27-154 of SEQ ID NO: 525) were generated by standard procedures and their 
ability to be recognized by CD4 cells was determined essentially as described above. 

15 DC were pulsed overnight with pools of the peptides with each peptide at a final 
concentration of 10 micrograin/ml. A large number of individual CD4 T cell lines 
(65/480) demonstrated significant proliferation and cytokine release (IFN-gamma) in 
response to the P703P peptide pools but not to a control peptide pool. The CD4 T cell 
iines which demonstrated specific activity were restimulated on the appropriate pool of 

10 P703P peptides and reassayed on the individual peptides of each pool as well as a 
peptide dose titration of the pool of peptides in a IFN-gamma release assay and in a 
proliferation assay. 

Sixteen immunogenic peptides were recognized by the T cells from the 
entire set of peptide antigens tested. The amino acid sequences of these peptides are 

:5 provided in SEQ ID NO: 656-671, with the conesponding cDNA sequences being 
provided in SEQ ID NO: 640-655. respectively. In some cases the peptide reactivity of 
the T cell line could be mapped to a single peptide, however some could be mapped to 
more than one peptide in each pool. Those CD4 T cell lines that displayed a 
representative pattern of recognition from each peptide pool wath a reasonable affinity 

0 tor peptide were chosen for further analysis (I-l A, -6A; II-4C, -5E; III-6E, 1V-4B, -3F, - 
-I OF, V-5B, -4D, and -lOF). These CD4 T cells lines were restimulated on the 
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appropriate individual peptide and reassayed on autologous DC pulsed with a truncated 
fomi of recombinant P703P protein made in E. coli (a.a. 96 - 254 of SEQ ID NO: 525), 
full-length P703P made in the baculovirus expression system, and a fusion between 
influenza virus NSl and P703P made in E. coli. Of the T cell lines tested, line I-IA 
5 recognized specifically the truncated form of P703P {E. coli) but no other recombinant 
form of P703P. This line also recognized the peptide used to elicit the T cells. Line 2- 
4C recognized the truncated form of P703P {E, coli) and the full length form of P703P 
made in baculovirus, as well as peptide. The remaining T cell lines tested were either 
peptide-spccific only (I1-5E, II-6F, IV-4B, IV-3F, IV-9B, IV-IOF, V-5B and V-4D) or 

10 were non-rcsponsi\e to any antigen tested (V-IOF). These results demonstrate that the 
peptide sequence RPLLANDLMLIKLDE (SEQ ID NO: 671; corresponding to a.a. 110- 
124 of SHQ ID NO: 525) recognized by the T cell line I-IA, and the peptide sequences 
SVSESD TIRSISIAS (SEQ ID NO: 668; corresponding to a.a. 125-139 of SEQ ID NO: 
525) and ISIASQCPTAGNSCL (SEQ ID NO: 667; con-esponding to a.a. 135-149 of 

15 SEQ ID NO: 525) recognized by the T cell line II-4C may be naturally processed 
epitopes of the P703P protein. 



EXAIvIPLE 1 1 
Expression Of A Breast Tumor-Derived Antigen 
20 In Prostate 



Isolation of the antigen B305D from breast tumor by differential display 
is described in US Patent Application No. 08/700,014, filed August 20, 1996. Several 
different splice forms of this antigen were isolated. The determined cDNA sequences 

25 for these splice forms are provided in SEQ ID NO: 366-375, with the predicted amino 
acid sequences corresponding to the sequences of SEQ ID NO: 292, 298 and 301-303 
being provided in SEQ ID NO: 299-306, respectively. In further studies, a splice 
variant of the cDNA sequence of SEQ ID NO: 366 was isolated which was found to 
contain an additional guanine residue at position 884 (SEQ ID NO: 530), leading to a 

30 frameshift in the open reading frame. The determined DNA sequence of this ORF is 
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proA'ided in SEQ ID NO: 531. This frameshift generates a protein sequence (provided 
in SEQ ID NO: 532 ) of 293 aniino acids that contains the C-temiinal domain common 
to the other isofomis of B305D hut that differs in the N-terminal region. 

The expression levels of B305D in a variety of tumor and nornial tissues 
5 were examined by real time PCR and by Northern analysis. Ihe results indicated that 
B305D is highly expressed in breast tumor, prostate tumor, normal prostate and nornial 
testes, with expression being low or undetectable in all other tissues examined (colon 
tumor, lung tumor ovar\' tumor, and normal bone marrow, colon, kidney, liver, lung, 
ovar>\ skin, small intestine, stomach). Using real-time PCR on a panel of prostate 
] 0 tumors, expression of B305D in prostate tumors was shown to increase with increasing 
Gleason grade, demonstrating that expression of B305D increases as prostate cancer 
progresses. 



EXAMPLE 12 

1 5 GENERATION Of Human Ctl In Vitro Using Whole Gene Priming And Stimulation 
Techniques With the Prostate-Specific Antigen P501 S . . . 



Using /;? vitro whole-gene priming with P501S-vaccinia infected DC 
(see, for example, Yee et ai. The Journal of Immunology, 157(9):4079-86, 1996), 

20 human CTL lines were derived that specifically recognize autologous fibroblasts 
transduced with P501S (also known as Ll-12), as determined by interferon-y ELISPOT 
analysis as described above. Using a panel of HLA-mismatched B-LCL lines 
transduced with P501S, these CTL lines were shown to be likely restricted to HLAB 
class I allele. Specifically, dendritic cells (DC) were differentiated from monocyte 

25 cultures derived from PBMC of normal human donors by growing for five days in 
RPMI medium containing 10% human serum, 50 ng/ml human GM-CSF and 30 ng/ml 
human IL-4. Following culture, DC w^ere infected overnight with recombinant P501S 
vaccinia virus at a multiplicity of infection (M.O.I) of five, and matured overnight by 
the addition of 3 pg/ml CD40 ligand. Virus w^as inactivated by UV irradiation. CD8+ 

30 T cells W'cre isolated using a magnetic bead system, and priming cultures were initiated 
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using standard culture tecliniques. Cultures were restimulated every 7-10 days using 
autologous primaiy fibroblasts retrovirally transduced with P501S and CD80. 
Following four stimulation cycles, CD8+ T cell lines were identified that specifically 
produced interferon-y when stimulated with P501S and CD80-transduced autologous 
5 fibroblasts. A panel of HLA-mismatched B-LCL lines transduced with P501S were 
generated to define the restriction allele of the response. By measuring interferon-y in 
an ELISPOT assay, the P501S specific response was shown to be likely restricted by 
HLA B alleles. These results demonstrate that a CD8-f CTL response to P501S can be 
elicited. 

10 To identify the epitope(s) recognized, cDNA encoding P501S was 

fragmented by various restriction digests, and sub-cloned into the retroviral expression 
vector pBIB-KS. Retroviral supematants were generated by transfecfion of the helper 
packaging line Phoenix- Ampho. Supematants were then used to transduce 
Jurkat/A2Kb cells for CTL screening. CTL were screened in IFN-gamina ELISPOT 

15 assays against these A2Kb targets transduced with the "library" of P50IS fragments. 
Initial positive fi-agments P501S/H3 and P50IS/F2 were sequenced and found to encode 
ammo acids 106-553 and amino acids 136-547, respectively, of SEQ ID NO: 113. A 
truncafion of H3 was made to encode amino acid residues 106-351 of SEQ ID NO: 113, 
which was unable to stimulate the CTL, thus localizing the epitope to amino acid 

20 residues 351-547. Additional fragments encoding amino acids 1-472 (Fragment A) and 
amino acids 1-351 (Fragment B) were also constructed. Fragment A but not Fragment 
B stimulated the CTL thus localizing the epitope to amino acid residues 351-472. 
Overlapping 20-mer and 1 8-mer peptides representing this region were tested by pulsing 
Jurkat/A2Kb cells versus CTL in an IFN-gamma assay. Only peptides P50 18-369(20) 

25 and P501S-369(18) stimulated the CTL. Nine-mer and 10-mer peptides representing 
this region were synthesized and similarly tested. Peptide P501S-370 (SEQ ID NO: 

539) was the minimal 9-mer giving a strong response. Peptide P501S-376 (SEQ ID NO: 

540) also gave a weak response, suggesting that it might represent a cross-reactive 
epitope. 
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In subsequent studies, the ability of priman' human B cells transduced 
with P501S to prime MHC class I-restricted, P501 S-specific. autologous CDS T cells 
was examined. Primary' B cells were derived from PBMC of a homoz>'gous HLA-A2 
donor by culture in CD4() ligand and lL-4, transduced at high frequency with 
5 recombinant P501S in the vector pBIB, and selected with blastocidin-S. For ui vitro 
priming- purified CD8+ T cells were cultured with autologous CD40 hgand + IL-4 
derived, P501S-transduced B cells in a 96-well microculture format. These CTL 
niicrocultures were re-stimulated with PSOlS-transduced B cells and then assayed for 
specificitv\ Following this initial screen, microcuhures with significant signal above 

10 background w^ere cloned on autologous EBV-transformed B cells (BLCL), also 
transduced with P501S. Using IFN-gamma ELISPOT for detection, several of these 
CDS T cell clones were found to be specific for P501S, as demonstrated by reactivity to 
BLCL,^501S but not BLCL transduced with control antigen. It was further 
demonstrated that the anti-P501S CDS T cell specificit)^ is HLA-A2-restricted. First, 

15 antibody blocking experiments with anti-HLA-A,B,C monoclonal antibody (W6.32), 

anti-HLA-B,C monoclonal antibody (BL23.2) and a control monoclonal antibody 

showed that only the anti-HLA-A,B,C antibody blocked recognition of P501S- 

expressing autologous BLCL. Secondly, the anti-P501S CTL also recognized an HLA- 

A2 matched, heterologous BLCL transduced with P501S, but not the corresponding 

20 EGFP transduced control BLCL. 

A naturally processed, CDS, class Lrestricted peptide epitope of P501S 

was identified as follows. Dendritic Cells (DC) were isolated by Percol gradient 

followed by differential adherence, and cultured for 5 days in the presence of RPMI 

medium containing 1% human serum, 50ng/ml GM-CSF and 30ng/ml IL-4. Following 

25 cultui^e, DC were infected for 24 hours with P501S-expressing adenovirus at an MOl of 
10 and matured for an additional 24 hours by the addition of 2ug/ml CD40 ligand. CDS 
cells were enriched for by the subtraction of CD4+, CD14+ and CD16+ populations 
from PBMC with magnetic beads. Priming cultures containing 10,000 P50IS- 
expressing DC and 1 00,000 CD8+ T cells per well were set up in 96-w^ell V-bottom 

>0 plates with RPMI containing 10% human serum, 5ng/ml IL-12 and lOng/ml IL-6. 
Cultures were stimulated every 7 days using autologous fibroblasts retrovirally 
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transduced to express P501S and CD80, and were treated with IFN-gamma for 48-72 
hours to upregulate MHC Class I expression. lOu/ml IL-2 was added at the time of 
stimulation and on days 2 and 5 following stimulation. Follov/ing 4 stimulation cycles, 
one P501S-specific CD8+ T cell line (refen^ed to as 2A2) was identified that produced 
5 IFN-gamma in response to I FN -gamma-treated P501S/CD80 expressing autologous 
fibroblasts, but not in response to IFN-gamma-treated P703P/CD80 expressing 
autologous fibroblasts in a y-IFN Elispot assay. Line 2A2 was cloned in 96- well plates 
with 0.5 cell/well or 2 cells/well in the presence of 75,000 PBMC/well, 10,000 B- 
LCL/well, 30ng/ml 0KT3 and 50u/ml IL-2. Twelve clones were isolated that showed 

1 0 strong P501 S specificity in response to transduced fibroblasts. 

Fluorescence activated cell sorting (FACS) analysis was performed on 
P501S-specific clones using CD3-, CD4- and CD8-specific antibodies conjugated to 
PercP, FITC and PE respectively. Consistent with the use of CD8 enriched T cells in 
the priming cultures, P5401S-specific clones were determined to be CD3+, CD8+ and 

15 CD4-- 

To identify the relevant P501S epitope recognized by P501S specific 
CTL, pools of 18-20 mer or 30-mer peptides that spanned the majority of the amino 
acid sequence of P50IS were loaded onto autologous B-LCL aiid tested in y-IFN Elispot 
assays for the ability to stimulate two PSOlS-specific CTL clones, referred to as 4E5 

20 and 4E7. One pool, composed of five 1 8-20 mer peptides that spanned amino acids 411- 
486 of P501S (SEQ ID NO: 1 13), was found to be recognized by both PSOlS-specific 
clones. To identify the specific 1 8-20 mer peptide recognized by the clones, each of the 
18-20 mer peptides that comprised the positive pool were tested individually in y-IFN 
Elispot assays for the ability to stimulate the two P501S-specific CTL clones, 4E5 and 

25 4E7. Both 4E5 and 4E7 specifically recognized one 20-mer peptide (SEQ ID NO: 710; 
cDNA sequence provided in SEQ ID NO: 711) that spanned amino acids 453-472 of 
P501 S. Since the minimal epitope recognized by CD8+ T cells is almost always either 
a 9 or 10-mer peptide sequence, 10-mer peptides that spanned the entire sequence of 
SEQ ID NO: 710 were synthesized that differed by 1 amino acid. Each of these 10-mer 

30 peptides was tested for the ability to stimulate two PSOlS-specific clones, (referred to as 
1D5 and 1E12). One 10-mer peptide (SEQ ID NO: 712; cDNA sequence provided in 
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SEQ ID NO: 713) was identified that specifically stimulated the P501 S-specitlc clones. 
This epitope spans amino acids 463-472 of P501S. This sequence defines a minimal 10- 
mer epitope fi-om P501S that can be naturally processed and to which CTL responses 
can be identified in normal PBMC. Thus, this epitope is a candidate for use as a vaccine 
5 moiety, and as a therapeutic and/or diagnostic reagent for prostate cancer. 

To identify the class I restriction element for the P501 S-derived sequence 
of SEQ ID NO: 712, HLA blocking and mismatch analyses were performed. In y-IFN 
Elispot assays, the specific response of clones 4A7 and 4E5 to P50IS-transduced 
autologous fibroblasts was blocked by pre-incubation with 25ug/ml W6/32 (pan-Class I 
10 blocking antibody) and Bl.23.2 (HLA-B/C blocking antibody). These results 
demonstrate that the SEQ ID NO: 712-specific response is restricted to an HLA-B or 
HLA-C allele. 

For the HLA mismatch analysis, autologous B-LCL (IILA- 
AEA23835L CwT, Cw7) and heterologous B-LCL (HLA- 

15 A2,A3318351,Cvv5,Cwl4) that share the HLAB51 allele were pulsed for one hour 
with 20ug/ml of peptide of SEQ ID NO: 712, washed, and tested in y-IFN Elispot assays 
for the ability to stimulate clones 4A7 and 4E5. Antibody blocking assays with the 
BL23.2 (HLA-B/C blocking anfibody) were also performed. SEQ ID NO: 712-specific 
response was detected using both the autologous (D326) and heterologous (D107) B- 

2(f LCL, and furthermore the responses were blocked by pre-incubation with 25ug/ml of 
Bl.23.2 HLA-B/C blocking antibody. Together these results demonstrate that the 
P501S-specific response to the peptide of SEQ ID NO: 712 is restricted to the HLA- 
B51 class I allele. Molecular cloning and sequence analysis of the HLA-B51 allele from 
D3326 revealed that the HLA-B51 subt>'pe of D326 is HLA-B5101 L 

25 Based on the 10-mer P501 S-derived epitope of SEQ ID NO: 712, two 9- 

mers with the sequences of SEQ ID NO: 714 and 715 were synthesized and tested in 
Elispot assays for the ability to stimulate tw^o P501S-specific CTL clones derived from 
line 2A2. The 10-mer peptide of SEQ ID NO: 712, as well as the 9-mer peptide of SEQ 
ID NO: 715, but not the 9-mer peptide of SEQ ID NO: 714. w^ere capable of stimulating 

30 the P501 S-specific CTL to produce IFN-gamma. These results demonstrate that the 
peptide of SEQ ID NO; 715 is a 9-mer P50] S-derived epitope recognized by P501S- 
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specific CTL. The DNA sequence encoding the epitope of SEQ ID NO: 715 is provided 
in SEQ ID NO: 716. 

To identify the class I restricting .allele for the P501S-derived peptide of 
SEQ ID NO; 712 and 715 specific response, each of the HLA B and C alleles were 
5 cloned from the donor used in the in vitro priming experiment. Sequence analysis 
indicated that the relevant alleles were HLA-B8, HLA-B51, HLA-CwOI and HLA- 
Cw07. Each of these alleles were subcloned into an expression vector and co- 
transfected together with the P501S gene into VA-13 cells. Transfected VA-13 cells 
were then tested for the ability to specifically stimulate the P501S-specific CTL in 

10 ELISPOT assays. VA-13 cells transfected with P501S and HLA-B51 were capable of 
stimulating the P501S-specific CTL to secrete gamma-IFN. VA-13 cells transfected 
with HLA-B51 alone or P501S + the other HLA-alleles were not capable of stimulating 
the P501S-specific CTL. These results demonstrate that the restricting allele for the 
P501S-specific response is the HLAB51 allele. Sequence analysis revealed that the 

15 subtype of the relevant restricting allele is HLA-B5101 1. 

To determine if the P501S-specific CTL could recognize prostate tumor 
cells that express P501S, the P501S-positive lines LnCAP and CRL2422 (both 
expressing "moderate" amounts of P501S mPv>>[A and protein), and PC-3 (expressing 
low amounts of P501S mRNA and protein), plus the P501S-negative cell line DU-145 

20 were retro virally transduced with the HLA-B51011 allele that was cloned from the 
donor used to generate the P501S-specific CTL. HLA-B51011- or EGFP-transduced 
and selected tumor cells were treated with gamma-interferon and androgen (to 
upregulate stimulatory functions and P501S, respectively) and used in gamma- 
interferon Elispot assays with the P501S-specific CTL clones 4E5 and 4E7. Untreated 

25 cells were used as a control. 

Both 4E5 and 4E7 efficiently and specifically recognized LnCAP and 
CRL2422 cells that were transduced with the HLA-B5101 1 allele, but not the same cell 
lines transduced with EGFP. Additionally, both CTL clones specifically recognized 
PC-3 cells transduced with HLA-B51011, but not the P501S-negative tumor cell line 

30 DU-145. Treatment with gamma-interferon or androgen did not enhance the ability of 
CTL to recognize tumor cells. These results demonstrate that PSOlS-specific CTL, 
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generated by in vitro whole gene priming, specifically and efficiently recognize prostate 
tumor cell lines that express P501 S. 

A naturally processed CD4 epitope of P501S was identified as follows. 
CD4 cells specific for P501S were prepared as described above. A series 
5 of 16 overlapping peptides were synthesized that spanned approximately 50% of the 
amino terminal portion of the P501S gene (amino acids 1- 325 of SEQ ID NO: 1 13). 
For priming, peptides were combined into pools of 4 peptides, pulsed at 4 pg/ml onto 
dendritic cells (DC) for 24 hours, with TNF-alpha. DC were then washed and mixed 
with negatively selected CD4+ T cells in 96 well U-bottom plates. Cultures were re- 

10 stimulated weekly on fresh DC loaded with peptide pools. Following a total of 4 
stimulation cycles, cells were rested for an additional week and tested for specificity to 
APC pulsed with peptide pools using y-IFN ELISA and proliferation assays. For these 
assays, adherent monocytes loaded with either the relevant peptide pool at 4ug/ml or an 
irrelevant peptide at f^ig/ml were used as APC. T cell lines that demonstrated either 

1 5 specific cytokine secretion or proliferation were then tested for recognition of individual 
peptides that were present in the pool. T cell lines could be identified from pools A and 
B that recognized individual peptides from these pools. 
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From pool A, lines AD9 and AEIO specifically recognized peptide 1 
(SEQ ID NO: 719), and line AF5 recognized peptide 39 (SEQ ID NO: 718). From pool 
B, line BC6 could be identified that recognized peptide 58 (SEQ ID NO: 717). Each of 
these lines were stimulated on the specific peptide and tested for specific recognition of 
5 the peptide in a titration assay as well as cell lysates generated by infection of HEK 293 
cells with adenovirus expressing either P501 S or an irrelevant antigen. For these assays, 
APC-adherent monocytes were pulsed with either 10, 1, or 0.1 |iig/ml individual P501S 
peptides, and DC were pulsed ovemight with a 1:5 dilution of adenovirally infected cell 
lysates. Lines AD9, AEIO and AF5 retained significant recognition of the relevant 

10 P501S-derived peptides even at 0.1 mg/ml. Furthermore, line AD9 demonstrated 
significant (8.1 fold stimulation index) specific activity for lysates from adenovirus- 
P501S infected cells. These results demonstrate that high affinity CD4 T cell lines can 
be generated toward P501S-derived epitopes, and that at least a subset of these T cells 
specific for the P501S derived sequence of SEQ ID NO: 719 are specific for an epitope 

15 that is naturally processed by human cells. The DNA sequences encoding the amino 
acid sequences of SEQ ID NO: 717-719 are provided in SEQ ID NO: 720-722, 
respectively. 

To further characterize the P501S-spccific activity^ of AD9, the line vvas 
cloned using anti-CD3. Three clones, referred to as lAl, 1A9 and IF5, were identified 

20 that were specific for the P501S-1 peptide (SEQ ID NO: 719). To determine the HLA 
restriction allele for the P501S-specific response, each of these clones was tested in 
class II antibody blocking and HLA mismatch assays using proliferation and gamma- 
interferon assays. In antibody blocking assays and measuring gamma-interferon 
production using ELISA assays, the ability of all three clones to recognize peptide 

25 pulsed APC was specifically blocked by co-incubation with either a pan-class II 
blocking antibody or a HLA-DR blocking antibody, but not with a HLA-DQ or an 
irrelevant antibody. Proliferation assays performed simultaneously with the same cells 
confirmed these results. These data indicate that the P501S-specific response of the 
clones is restricted by an HLA-DR allele. Further studies demonstrated that the 

30 restricting allele for the P501 S-specific response is HLA-DRB1501. 
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EXAMPLE 13 

IDENTIFJCATJON Of PROSTATE-SPHCJFJC ANTIGENS 

By Mjcroarray Analysis 



5 This Example describes the isolation of certain prostate-specific 

polypeptides from a prostate tumor cDNA library. 

A human prostate tumor cDNA expression librar\' as described above 
was screened using microarray analysis to identify clones that display at least a three 
fold over-expression in prostate tumor and/or normal prostate tissue, as compared to 

10 non-prostate normal tissues (not including testis). 372 clones were identified, and 319 
were successfully sequenced. Table I presents a summar\^ of tliese clones, which are 
shown in SEQ ID NOs:385-400. Of these sequences SEQ ID NOs:386, 389, 390 and 
392 conespond to novel genes, and SEQ ID NOs: 393 and 396 correspond to previously 
identified sequences. The others (SEQ ID NOs:385, 387, 388, 391, 394, 395 and 397- 

1 5 400) coiTCspond to IcnouTn sequences, as shown in Table I. 
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Table I 

Summary of Prostate Tumor Antigens 



ICnown rTf*ne<=i 


Prpviryii<^i V IrlfntifipH CTPriP^; 




T-cell gamma chain 


P504S 


23379 (SEQIDNO:389) 


Kailikrein 


PIOOOC 


23399 (SEQ 1D1SI0:392) 


Vector 


P501S 


23320 (SEQ ID NO:386) 


CGl-82 protem mKNA (23319; SEQ ID NO:385) 


P503S 


23381 (SEQ ID 
NO:390) 


PSA 


P510S 




AId.6Dehyd. 


P784P 




L-iditoI-2 dehydrogenase (23376; SEQ IDNO:3S8) 


P502S 




Ets transcription factor PDEF (22672; SEQ ID 
NO:398) 


P706P 




hTGR (22678; SEQ IDNO:399) 


1 9 1 42.2, bangur.seq (2262 1 ; 
SEQ ID NO:396) 




KIAA0295(22685; SEQ ID NO:400) 


5566.1 Wang (23404; 
SEQ ID NO:393) 




Prostatic Acid Phosphatase(22655; SEQ ID 
NO:397) 


P712P 




transglutaminase (2261 1 ; SEQ ID NO:395) 


P778P 




HDLBP (23508; SEQ lDNO:394) 






CGI-69 Protein(23367; SEQ ID NO:387) 






K1AA0122(23383; SEQ IDNO:391) 






TEEG 
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CGl-82 showed 4.06 fold over-expression in prostate tissues as 
compared to other nonnal tissues tested. It was over-expressed in 43% of prostate 
tumors. 25% normal prostate, not detected in other nomial tissues tested. L-iditol-2 
dehydrogenase showed 4.94 fold over-expression in prostate tissues as compared to 
5 other normal tissues tested. It was over-expressed in 90% of prostate tumors, 100% of 
normal prostate, and not detected in other normal tissues tested. Ets transcription factor 
PDEF showed 5.55 fold over-expression in prostate tissues as compared to other normal 
tissues tested. It was over-expressed in 47% prostate tumors, 25% normal prostate and 
not detected in other normal tissues tested. hTGRl showed 9.1 1 fold over-expression in 

10 prostate tissues as compared to other normal tissues tested. It was over-expressed in 
63% of prostate tumors and is not detected in normal tissues tested including normal 
prostate. KJAA0295 showed 5.59 fold over-expression in prostate tissues as compared 
to other noniial tissues tested. It was over-expressed in 47% of prostate tumors, low to 
undetectable in normal tissues tested including normal prostate tissues. Prostatic acid 

15 phosphatase showed 9.14 fold over-expression in prostate tissues as compared to other 
normal tissues tested. It was over-expressed in 67% of prostate tumors, 50% of normal 
prostate, and not detected in other normal tissues tested. Transglutaminase showed 
14.84 fold over-expression in prostate tissues as compared to other normal tissues 
tested. It was over-expressed in 30% of prostate tumors, 50% of normal prostate, and is 

20 not detected in other noraial tissues tested. High density lipoprotein binding protein 
(HDLBP) showed 28.06 fold over-expression in prostate tissues as compared to other 
normal tissues tested. It was over-expressed in 97% of prostate tumors, 75% of normal 
prostate, and is undetectable in all other normal tissues tested. CGI-69 showed 3.56 
fold over-expression in prostate tissues as compared to other nomial tissues tested. It is 

25 a low^ abundant gene, detected in more than 90% of prostate tumors, and in 75% nonnal 
prostate tissues. The expression of this gene in normal tissues w^as very low. 
KIAA0122 show^ed 4.24 fold over-expression in prostate tissues as compared to other 
noiTOal tissues tested. It was over-expressed in 57% of prostate tumors, it w^as 
undetectable in all normal tissues tested including nomial prostate tissues. 19142.2 

30 bangur show^ed 23.25 fold over-expression in prostate tissues as compared to other 
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normal tissues tested. It was over-expressed in 97% of prostate tumors and 100% of 
normal prostate. It was undetectable in other normal tissues tested. 5566. 1 Wang 
showed 3.31 fold over-expression in prostate tissues as compared to other normal 
tissues tested- It was over-expressed in 97% of prostate tumors, 75% normal prostate 
5 and was also over-expressed in normal bone marrow, pancreas, and activated PBMC. 
No\'eI clone 23379 (also referred to as P553S) showed 4.86 fold over-expression in 
prostate tissues as compared to other normal tissues tested. It was detectable in 97% of 
prostate tumors and 75% normal prostate and is undetectable in all other normal tissues 
tested. Nmcl clone 23399 showed 4.09 fold over-expression in prostate tissues as 

10 compared to other nomial tissues tested. It was over-expressed in 27% of prostate 
tumors and was undetectable in all normal tissues tested including normal prostate 
tissues. No\'eI clone 23320 showed 3.15 fold over-expression in prostate tissues as 
compared lo other normal tissues tested. It was detectable in all prostate tumors and 
50% of nonnal prostate tissues. It was also expressed in normal colon and trachea. 

15 Other nonnal tissues do not express this gene at high level. 

Subsequent full-length cloning studies on P553S, using standard 
techniques, revealed that this clone is an incomplete spliced form of P501S. The 
determined cDNA sequences for four splice variants of P553S are provided in SEQ ID 
NO: 623-626. An amino acid sequence encoded by SEQ ID NO: 626 is provided in 

20 SEQ ID NO: 627. The cDNA sequence of SEQ ID NO: 623 was found to contain two 
open reading frames (ORFs). The amino acid sequences encoded by these two ORFs 
are provided in SEQ ID NO: 628 and 629. 



EXAMPLE 14 

25 Identification Of Prostate-Specific Antigens 

By Electronic Subtraction 



This Example describes the use of an electronic subtraction technique to 
identify prostate-specific antigens. 
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Potential prostate-specific genes present in the GenBank human EST 
database were identified by electronic subtraction (similar to that described by 
Vasmatizis et al.. Proc. Natl Acad. Sci. USA 95:300-304, 1998). The sequences of EST 
clones (43,482) deri\ed from various prostate libraries were obtained from the GenBank 
5 public human EST database. Each prostate EST sequence was used as a quer> sequence 
in a BLASTN (National Center for Biotechnology Information) search against the 
human EST database. All matches considered identical (length of matching sequence 
>100 base pairs, densitv^ of identical matches over this region > 70%) were grouped 
(aligned) together in a cluster. Clusters containing more than 200 ESTs were discai'ded 
10 since they probably represented repetitive elements or highly expressed genes such as 
those for ribosomal proteins. If two or more clusters shared common ESTs, those 
clusters were grouped together into a ''superclusten'' resulting in 4,345 prostate 
superclusters. 

Records for the 479 human cDNA libraries represented in the GenBank 
1 5 release were downloaded to create a database of these cDNA library records. These 479 
cDNA libraries were grouped into three groups: Plus (normal prostate and prostate 
tumor libraries, and breast cell line libraries, in which expression was desired). Minus 
(libraries from other normal adult tissues, in which expression was not desirable), and 
Other (libraries from fetal tissue, infant tissue, tissues found only in women, non- 
20 prostate tumors and cell lines other than prostate cell lines, in which expression was 
considered to be irrelevant). A summary of these library groups is presented in Table IT 
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Table II 

Prostate cDNA Libraries and ESTs 



Library 


# of Libraries 


# of ESTs 


Plus 


25 


43,482 


Normal 


11 


18,875 


Tumor 


11 


21,769 


Cell lines 


3 


2,838 


Minus 


166 




Other 


287 





5 Each supercluster was analyzed in terms of the ESTs within the 

supercluster. The tissue source of each EST clone was noted and used to classify the 
superclusters into four groups: Type 1- EST clones found in the Plus group libraries 
only; no expression detected in Minus or Other group libraries; Type 2- EST clones 
derived from the Plus and Other group libraries only; no expression detected in the 

10 Minus group; Type 3- EST clones derived from the Plus, Minus and Other group 
libraries, but the number of ESTs derived from the Plus group is higher than in either 
the Minus or Other groups; and Type 4- EST clones derived from Plus, Minus and 
Other group libraries, but the number derived from the Plus group is higher than the 
number derived from the Minus group. This analysis identified 4,345 breast clusters 

15 (see Table III). From these clusters, 3,172 EST clones were ordered from Research 
Genetics, Inc., and were received as frozen glycerol stocks in 96-well plates. 
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Fable III 
Prostate Cluster Summan^ 



Type 


#of 
Superclusters 


ft of ESTs 
Ordered 


] 


688 


677 


2 


2899 


2484 


3 


85 


n 


4 


673 


0 


Total 


4345 


3172 



5 The EST clone inserts were PCR-amplified using amino-linked PCR 

primers for Synteni microarray analysis. When more than one PCR product was 
obtained for a particular clone, that PCR product was not used for expression analysis. 
In total, 2,528 clones from the electronic subtraction method were analyzed by 
microarray analysis to identify electronic subtraction breast clones that had high levels 

1 0 of tumor vs. normal tissue mRNA. Such screens were performed using a Synteni (Palo 
Alto, CA) microarray, according to the manufacturer's instructions (and essentially as 
described by Schena et ah, Froc, Natl. Acad Sci. USA PJ:10614-10619, 1996 and 
Heller et ah, Proc. Natl. Acad Sci. USA 5^:2150-2155, 1997). Within these analyses, 
the clones were arrayed on the chip, which was then probed with fluorescent probes 

15 generated from normal and tumor prostate cDNA, as well as various other normal 
tissues. The slides were scanned and the fluorescence intensity was measured. 

Clones with an expression ratio greater than 3 (i.e., the level in prostate 
tumor and normal prostate mRNA was at least three times the level in other normal 
tissue mRNA) were identified as prostate tumor-specific sequences (Table IV). The 

20 sequences of these clones are provided in SEQ ID NO: 401-453, with certain novel 
sequences shown in SEQ ID NO: 407, 413, 416-419, 422, 426, 427 and 450. 
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Table IV 
Prostate-tumor Specific Clones 





OCtJUClJC/C 


L^orniTJcriis 


H\J 1 


ZZ->4j 


previously identified PIOOOC 




OO ^ ^ 7 

j4 / 


previously identified P704P 


HUJ 


ZZ j4<S 


known 


Af\A 


OO C ^ A 

223j0 


known 


/I A C 


IZoj J 


T>C' A 

PSA 


4Uo 


ceo 
22332 


prostate secretory protein 94 


/I AT 

40 / 


oo ceo 

22553 


novel 




c c o 

22558 


previously identified P509S 


4Uy 


oo c /:r'> 

22562 


glandular kallikrein 


410 


22565 


previously identified PIOOOC 
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Further studies on the clone of SEQ ID NO: 407 (also referred to as 
P1020C) led to the isolation of an extended cDNA sequence provided in SEQ ID NO: 
591. This extended cDNA sequence was found to contain an open reading frame that 
5 encodes the predicted amino acid sequence of SEQ ID NO: 592. The P1020C cDNA 
and amino acid sequences were found to show some similarity to the human 
endogenous retroviral HERV-K pol gene and protein. 

EXAMPLE 15 

1 0 Further Identification Of Prostate-Specific Antigens By Microarray Analysis 

This Example describes the isolation of additional prostate-specific 
polypeptides from a prostate tumor cDNA Iibrar>\ 

A human prostate tumor cDNA expression library as described above 
15 was screened using microarray analysis to identify clones that display at least a three 
fold over-expression in prostate tumor and/or normal prostate tissue, as compared to 
non-prostate normal tissues (not including testis). 142 clones were identified and 
sequenced. Certain of these clones are shown in SEQ ID NO: 454-467. Of these 
sequences. SEQ ID NO: 459-460 represent novel genes. The others (SEQ ID NO: 454- 
20 458 and 461-467) correspond to known sequences. Comparison of the determined 
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cDNA sequence of SEQ ID NO: 461 with sequences in the Genbank database using the 
BLAST program revealed homology to the previously identified transmembrane 
protease serine 2 (TMPRSS2). The full-length cDNA sequence for this clone is 
provided in SEQ ID NO: 751, with the corresponding amino acid sequence being 
5 provided in SEQ ID NO: 752. The cDNA sequence encoding the first 209 amino acids 
of TMPRSS2 is provided in SEQ ID NO: 753, with the first 209 amino acids being 
provided in SEQ ID NO: 754. 

The sequence of SEQ ID NO: 462 (referred to as P835P) was found to 
correspond to the previously identified clone FLJ13518 (Accession AK023643; SEQ ID 

10 NO: 774), which had no associated open reading fi-ame (ORF). This clone was used to 
search the Geneseq DNA database and matched a clone previously identified as a G 
protein-coupled receptor protein (DNA Geneseq Accession A09351; amino acid 
Geneseq Accession Y92365), that is characterized by the presence of seven 
transmembrane domains. The sequences of fragments between these domains are 

15 provided in SEQ ID NO: 778^785, with SEQ ID NO: 778, 780, 782 and 784 
representing extracellular domains and SEQ ID NO: 779, 781, 783 and 785 representing 
intracellular domains. SEQ ID NO: 778-785 represent amino acids 1-28, 53-61, 83- 
103, 124-143, 165-201, 226-238, 263-272 and 297-381, respectively, of P835P. The 
fiill-length cDNA sequence for P835P is provided in SEQ ID NO: 773. The cDNA 

20 sequence of the open reading frame for P835P, including stop codon, is provided in 
SEQ ID NO: 775, with the open reading fi-ame without stop codon being provided in 
SEQ ID NO: 776 and the corresponding amino acid sequence being provided in SEQ ID 
NO: 777. 



25 EXAMPLE 16 

Further Characterization Of Prostate-Specific Antigen P710P 

This Example describes the full length cloning of P710P. 
The prostate cDNA library described above was screened with the P710P 
30 fragment described above. One million colonies were plated on LB/Ampicillin plates. 
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Nylon membrane filters were used to lift these colonies, and the cDNAs picked up by 
these filters were then denatured and cross-linked to the filters by UV light. The P710P 
fragment was radiolabeled and used to hybridize with the filters. Positive cDNA clones 
were selected and their cDNAs recovered and sequenced by an automatic Perkin 
5 Elmer/ Applied Biosystems Division Sequencer. Four sequences were obtained, and are 
presented in SEQ ID NO: 468-471, These sequences appear to represent different splice 
variants of the P7I0P gene. Subsequent comparison of the cDNA sequences of P710P 
with those in Genbank revealed homology to the DD3 gene (Genbank accession 
numbers AF103907 & AF103908). The cDNA sequence of DD3 is provided in SEQ ID 
10 NO: 618. 

EXAMPLE 17 
Protein Expressjon Of Prostate-Specjfjc Antigens 

15 This example describes the expression and purification of prostate- 

specific antigens in E. coli, baculovirus, mammalian and yeast cells. 

a) Expression of P5Q1S in E. coli 

Expression of the full-length form of P501S was attempted by first 
cloning P501S without the leader sequence (amino acids 36-553 of SEQ ID NO: 1 13) 

20 dovsTistream of the first 30 amino acids of the M. tuberculosis antigen Ral2 (SEQ ID 
NO: 484) in pET17b. Specifically, P501S DNA was used to perform PGR using the 
primers AW025 (SEQ ID NO: 485) and AW003 (SEQ ID NO: 486). AW025 is a sense 
cloning primer that contains a Hindlll site. AW003 is an antisense cloning primer that 
contains an EcoRI site. DNA amplification was perfoixned using 5 p.1 lOX Pfu buffer, 1 

25 ul 20 mM dNTPs, 1 ^1 each of the PGR primers at 1 0 fj.M concentration, 40 f.il water, 1 
1^1 Pfu DNA polymerase (Stratagene, La Jolla, CA) and 1 \\\ DNA at 100 ng/|_d- 
Denaturation at 95°G was performed for 30 sec, followed by 10 cycles of 95°G for 30 
sec, 60°C for 1 min and by 72°G for 3 min. 20 cycles of 95°C for 30 sec, 65°C for 1 min 
and by 72°C for 3 min, and lastly by 1 cycle of 72^C for 10 min. The PGR product was 
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cloned to Ral2m/pET17b using Hindlll and EcoRI. The sequence of the resultin*^ 
fusion construct (referred to as Ral2-P501S-F) was confirmed by DNA sequencing. 

The fusion construct was transformed into BL21(DE3)pLysE, pLysS and 
CodonPlus E, coli (Stratagene) and grown overnight in LB broth with kanamycin. The 
5 resulting culture was induced with IPTG. Protein was transferred to PVDF membrane 
and blocked with 5% non-fat milk (in PBS-Tween buffer), washed three times and 
incubated with mouse anti-His tag antibody (Clontech) for 1 hour. The membrane was 
washed 3 times and probed with HRP-Protein A (Zymed) for 30 min. Finally, the 
membrane was washed 3 times and developed with ECL (Amersham). No expression 

10 was detected by Western blot. Similarly, no expression was detected by Western blot 
when the Ral2-P501S-F fusion was used for expression in BL21 CodonPlus by CE6 
phage (Invitrogen). 

An N-terminal fragment of P501S (amino acids 36-325 of SEQ ID NO: 
113) was cloned down-stream of the first 30 amino acids of the M tuberculosis antigen 

15 Ral2 in pET17b as follows. P501S DNA was used to perform PGR using the primers 
AW025 (SEQ ID NO: 485) and AW027 (SEQ ID NO: 487). AW027 is an antisense 
cloning primer that contains an EcoRI site and a stop codon. DNA amplification was 
performed essentially as described above. The resulting PGR product was cloned to 
Ral2 in pET17b at the Hindlll and EcoRI sites. The fusion construct (referred to as 

20 Ral2-P501S-N) was confirmed by DNA sequencing. 

The Ral2-P501S-N fusion construct was used for expression in 
BL2](DE3)pLysE, pLysS and CodonPlus, essentially as described above. Using 
Western blot analysis, protein bands were observed at the expected molecular weight of 
36 kDa. Some high molecular weight bands were also observed, probably due to 

25 aggregation of the recombinant protein. No expression was detected by Western blot 
when the Ral2-P501S-F fusion was used for expression in BL2 1 CodonPlus by CE6 
phage. 

A fusion construct comprising a C-terminal portion of P501S (amino 
acids 257-553 of SEQ ID NO: 113) located down-stream of the fu^st 30 amino acids of 
30 the M tuberculosis antigen Ral2 (SEQ ID NO: 484) was prepared as follows. P501S 
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DNA was used to perform PCR using the primers A\V026 (SEQ ID NO: 488) and 
AW003 (SEQ ID NO: 486). AW026 is a sense cloning primer that contains a Hindlll 
site. DNA amplification was performed essentially as described above. I he resulting 
PCR product was cloned to Ral2 in pET17b at the Hindlll and EcoRI sites. The 
5 sequence for the fusion construct (referred to as Ral2-P501 S-C) was confinned. 

The Ral2-P501S-C fusion construct w^as used for expression in 
BL21(DE3)pLysE, pLysS and CodonPlus, as described above. A small ainount of 
protein w^as detected by Western blot, with some molecular weight aggregates also 
being observed. Expression was also detected by Western blot when the Ral2-P501S-C 

1 0 fusion was used for expression in BL2I CodonPlus induced by CE6 phage. 

A fusion construct comprising a fragment of P501S (amino acids 36-298 
of SEQ ID NO: 1 13) located down-stream of the M tuhercidosis antigen Ral2 (SEQ ID 
NO: 705) was prepared as follows. P501S DNA was used to perform PCR using the 
primers AW042 (SEQ ID NO: 706) and AW053 (SEQ ID NO: 707). AW042 is a sense 

15 cloning primer that contains a EcoRI site. AW053 is an antisense primer with stop and 
Xho I sites. DNA amplification was performed essentially as described above. The 
resulting PCR product was cloned to Ral2 in pET17b at the EcoRI and Xho I sites. The 
resulting fusion construct (referred to as Ral2-P501 S-E2) was expressed in B834 (DE3) 
pLys S E. coll host cells in TB media for 2 h at room temperature. Expressed protein 

20 was purified by washing the inclusion bodies and running on a Ni-NTA column. The 
purified protein stayed soluble in buffer containing 20 mM Tris-HCl (pH 8), 100 mM 
NaCl, 10 mM (5-Me and 5% glycerol. The determined cDNA and amino acid sequences 
for the expressed fusion protein are provided in SEQ ID NO: 708 and 709, respectfully. 



25 b) Expression of P501S in Baculovirus 

The Bac-to-Bac baculovirus expression system (BRL Life Technologies, 
hic.) was used to express P501S protein in insect cells. Full-length P501S (SEQ ID 
NO: 113) was amplified by PCR and cloned into the Xbal site of the donor plasmid 
j)I'astBacI. l^he recombinant bacmid and baculovirus were prepared according to the 
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manufacturer's instructions. The recombinant baculovirus was amplified in Sf9 cells 
and the high titer viral stocks were utilized to infect High Five cells (Invitrogen) to 
make the recombinant protein. The identity of the full-length protein was confirmed by 
N-temiinal sequencing of the recombinant protein and by Western blot analysis (Figure 
5 7). Specifically, 0.6 million High Five cells in 6-well plates were infected with either 
the unrelated control virus BV/ECD_PD (lane 2), with recombinant baculovirus for 
P501S at different amounts or MOIs (lanes 4-8), or were uninfected (lane 3). Cell 
lysates were run on SDS-PAGE under reducing conditions and analyzed by Western 
blot with the anti-P501S monoclonal antibody P501S-10E3-G4D3 (prepared as 

1 0 described below). Lane 1 is the biotinylated protein molecular weight marker (BioLabs). 

The localization of recombinant P501S in the insect cells was 
investigated as follows. The insect cells overexpressing P501S were fractionated into 
fractions of nucleus, mitochondria, membrane and cytosol. Equal amounts of protein 
from each fraction were analyzed by Western blot with a monoclonal antibody against 

15 P501S. Due to the scheme of fractionation, both nucleus and mitochondria fractions 
contain some plasma membrane components. However, the membrane fraction is 
basically free from mitochondria and nucleus. P501S was found to be present in all 
fractions that contain the membrane component, suggesting that P501S may be 
associated with plasma membrane of the insect cells expressing the recombinant 

20 protein. 

c) Expression of P5Q1 S in Mammalian Cells 

Full-length P501 S (553 amino acids; SEQ ID NO: 1 1 3) was cloned into 
various mammalian expression vectors, including pCEP4 (Invitrogen), pVR1012 
(Vical, San Diego, CA) and a modified form of the retroviral vector pBMN, referred to 
25 as pBIB. Transfection of P501S/pCEP4 and P501S/pVR1012 into HEK293 fibroblasts 
was carried out using the Eugene transfection reagent (Boehringer Mannheim). Briefly, 
2 ul of Fugene reagent was diluted into 100 ul of serum-free media and incubated at 
room temperature for 5-10 min. This mixture was added to 1 ug of P501S plasmid 
DNA, mixed briefly and incubated for 30 minutes at room temperature. The 
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I ugene DNA mixture was added to cells and incubated for 24-48 hours. Expression of 
recombuiant P501S in transfected HHK293 fibroblasts was detected by means of 
Western blot employing a monoclonal antibody to P501S. 

Transfection of p501S/pCEP4 into CHO-K cells (.American Type 
5 Culture Collection, Rockville, MD) w^as carried out using GenePorter transfection 
reagent (Gene Therapy Systems, San Diego, CA). Briefly, 15 pi of GenePorter was 
diluted in 500 pi of serum-free media and incubated at room temperature for 10 min. 
fhc GenePorter media mixture was added to 2 pg of plasmid DNA that w^as diluted in 
500 pi of serum-free media, mixed briefly and incubated for 30 min at room 

10 temperature. CHO-K cells were rinsed in PBS to remove serum proteins, and the 
(jene Porter A mix was added and incubated for 5 hours. The transfected cells were 
then fed an equal volume of 2x media and incubated for 24-48 hours. 

FACS analysis of P501S transiently infected CHO-K cells, demonstrated 
surface expression orP501S. Expression was detected using rabbit polyclonal antisera 

15 raised against a P501S peptide, as described below. Flow cytometric analysis was 
perfoirned using a FaCScan (Becton Dickinson), and the data were analyzed using the 
Cell Quest program. 

d} Expression of P501S in S, cerevisiae 

20 P501S was expressed in yeast, directed in membranes, using the yeast a 

prepro signal sequence. The natural signal sequence and first lumenal domain of P501S 
was deleted in order to conser\^e the natural positioning of the expressed P501S protein. 

Specifically, the a prepro signal sequence of S, cerevisiae linked to 
amino acids 55-553 of SEQ ID NO: 113 with a His tag tail was cloned into the plasmid 

25 pRJT15068 with the CUPl promoter and transfected into S. cerevisiae strain Y1790. 
The Y1790 strain is Leu+ and His-. Expression of protein w^as induced by addition of 
either 500 pM or 250 pM of CUSO4 at 30 ^C in minimal medium supplemented with 
liistidine. Cells were harvested 24 hours after induction. Extracts w^ere prepared by 
growling cells to a concentration of OD600 5.0 in 50 mlVf citrate phosphate buffer (pH 

30 4.0) plus 130 mM NaCl supplemented with protease inhibitors. Cells were disrupted 
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using glass beads and centrifuged for 20 min at 15,000 g. The recombinant protein was 

found to be 100% pellet associated. 

Expression of the recombinant protein (molecular weight 63 kD) was 

demonstrated by Western blot analysis, using the anti-P501S monoclonal antibody lOE- 
5 D4-G3 described below. The amino acid sequence of the expressed protein is provided 
in SEQ ID NO: 792. 

Fermentation processes for the production of the oc prepro-P501S-His tag 
recombinant protein in S. cerevisiae (strain Y1790 - CUPl inducible promoter) were 
evaluated as follows. One hundred jal of a master seed containing 2.5 x 10^ cells/ml of 

10 transformed S. cerevisiae Y1790 were spread on FSC004AA solid medium. The 
composition of the FSC004AA medium is as follows: glucose 10 g/1; Na2Mo04.2H20 
0.0002 g/l; folic acid 0.000064 g/l; KH2PO4 1 g/1; MnS04.H20 0.0004 g/1; Inositol 
0.064 g/1; MgS04.7H20 0.5 g/1; H3BO3 0.0005 g/1; Pyridoxine 0.008 g/1; CaCl2.2H20 
0.1 g/1; Kl 0.0001 g/1; Thiamine 0.008 g/1; NaCl 0.1 g/1; C0CI2.6H2O 0.00009 g/1; 

15 Niacin 0.000032 g/1; FeCb.eHsO 0.0002 g/1; Riboflavin 0.000016 g/1; Panthotenate Ca 
0.008 g/1; CUSO4.5H2O 0.00004 gA; Biotin 0.000064 g/1; para-aminobenzoic acid 
0.000016 g/1; ZnS04.7H20 0.0004 g/1; (NH4)2S04 5 g/1; agar 18 g/1; Histidine 0.1 gA. 

Two plates were incubated for 26 h at 30 These solid pre-cultures 
were harvested in 5 ml of liquid medium FSC007AA and 0.5 ml (or 9.3 x 10^ cells) of 

20 this suspension was used to inoculate 2 liquid pre-cultures. 

The composition of the FSC007AA medium is as follows: Glucose 10 
g/1; Na2Mo04.2H20 0.0002 g/1; folic acid 0.000064 g/1; KH2PO4 1 g/1; MnS04.H20 
0.0004 g/1; Inositol 0.064 g/1; MgS04.7H20 0.5 g/1; H3BO3 0.0005 g/1; Pyridoxine 0.008 
g/1; CaCl2.2H20 0.1 g/1; KI 0.0001 g/1; Thiamine 0.008 g/1; NaCl 0.1 g/I; C0CI2.6H2O 

25 0.00009 g/1; Niacine 0.000032 g/1; FeCl3.6H20 0.0002 g/I; Riboflavin 0.000016 g/1; 
Panthotenate Ca 0.008 g/1; CUSO4.5H2O 0.00004 g/1; Biotin 0.000064 g/1; para- 
aminobenzoic acid 0.000016 g/1; ZnS04.7H20 0.0004 g/1; (NH4)2S04 5 g/1; Histidine 
0.1 g/1. 

These pre-cultures were run for 20 hours in 2L flasks containing 400 ml 
30 of medium FSC0Q7AA in order to obtain an OP of 1.8. The other characteristics of 
these pre-cultures are as follows: pH 2.8: glucose 2.3 g/L: ethanol 3.4 g/L. 
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The best timing for liquid pre-cultures for strain Y1790 was determined 
in preliniinarv' experiments. Liquid pre-cultures containing 400 ml of medium and 
inoculated with various volumes of Master Seed (0.25. 0.5, 1 or 2 ml) were monitored 
in order to identify the best inoculum size and timing. Glucose, ethanoL pH, OD and 
5 cell number (detemiined by How cytometr\') were followed between 16 and 23 hours of 
culture. Glucose exhaustion and maximal biomass were obtained after 20 hour 
incubation with 0.5 inoculum. These conditions were adopted for transferring the pre- 
culture into fermentation. 

In totaL 800ml of pre-culture were used to inoculate a 20 L fermenter 

10 containing 5L of medium FSC002AA. Ihree ml of irradiated antifoam were added 
before inoculation. The composition of the FSC002AA medium is as follows: 
(NH4)2S04 6.4 g/1; Na2Mo04.2H20 2.05 mg/1; folic acid 0.54 mg/1; KH2PO4 8.25 g/1; 
MnS04-H2O 4.1 mg/1; inositol 540 mg/; MgS04.7H20 4.69 g/1; H3BO3 5.17 m/l; 
pyridoxine 68 mg/1; CaCl2-2H20 0.92 g/1; Kl 1.03 mg/1: thiamine 68 mg/1; NaCl 0.06g/l; 

15 C0CI2-6H2O 0.92 mg/1; Niacine 0.27 mg/1; HCl 1 ml/1; FeCh-eU.O 9,92 mg/I; 
Riboflavin 0.13 mg/1: CUSO4-5H2O 0.41 mg/1; Glucose 0.14 g/1; Panthotenate Ca 68 
mg/1; ZnS04.7H20 4.1 mg/1; Biotin 0.54 mg/1; para-aminobenzoic acid 0.13 mg/1; 
Histidine 0.3 g/1 

The carbon source (glucose) was supplemented by a continuous feeding 
20 of FFB004AA medium. The composition of the FFB004AA medium is as follows: 
glucose 350 g/1; Na2Mo04.2H20 5.15 mg/1; folic acid 1.36 mg/1; KH2PO4 20.6 g/1; 
MnS04.H20 10.3 mg/1; inositol 1350 mg/1; MgS04.7H20 11.7 g/1; H3BO3 12.9 m/l; 
pyridoxine 170 mg/1; CaCl2.2H20 2.35 g/1; KA 2.6 mg/1; thiamine 170 g/1; NaCl 0.15 g/1; 
C0CI2.6H2O 2.3 mg/l; niacine 0.67 mg/1; HCl 2.5 ml/1; FeCl3.6H20 24.8 mg/1; 
25 riboflavin; 0.33 mg/1; CUSO4.5H2O 1.03 mg/1; biotin 1.36 mg/I; panthotenate Ca 170 
mg/1; ZnSO4.7H20 10.3 mg/1; para-aminobenzoic acid: 0.33 mg/1; histidine 5.35 g/1. 

The residual glucose concentration was maintained very low (050 mg/L) 
in order to minimize ethanol production by fermentation. This was achieved by limiting 
the development of the microorganism using a limited glucose feed rate. The Standard 
30 biomass content (OD 80-90) was reached in fermentation after 44 hour growth phase. 

CUPl promoter was then induced by adding 500(iM CUSO4 in order to 
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produce P501 S antigen. CUSO4 addition was followed by ethanol accumulation (up to 6 
g/L), and the glucose feeding rate was then reduced in order to consume the ethanol. 
Ihe copper available for the microorganism was monitored by testing Cu ion 
concentration in the broth supernatant using a spectrophotometric copper assay (DETC 
5 method). The fermentation was then supplemented by CUSO4 tluoughout the induction 
phase in order to maintain its concentration between 150 and 250 pM in the 
supernatant. The biomass reached an OD of 100 at the end of induction. Cells were 
harv ested after 8 hours of induction. 

Cell homogenate was prepared and analysed by SDS-PAGE and Western 

10 Biol using standard protocols. A major protein band with the expected molecular weight 
of 62KD wa-s detected by Western blot using anti-P501S monoclonal antibodies. 
Western blot analysis also showed that the major 62KD band was progressively 
produced from 30 minutes of induction on, and reached a maximum after 3 hours. No 
more antigen seemed to be produced between 3 and 12 hours of induction. 

15 The number of passages through a French Press necessary to extract all 

the antigen from the cells was evaluated. One, three and five passages were tested and 
total cell lysales, supematants and pellets of cell lysates were analysed by Western blot. 
Tbj-ee passages tluough a French Press were sufficient to completely extract the antigen. 
The antigen was present in the insoluble fraction. 

20 



e) Expression of P703P in Baculovirus 

The cDNA for full-length P703P-DE5 (SEQ ID NO: 326), together with 
several flanking restriction sites, was obtained by digesting the plasmid pCDNA703 
with restriction endonucleases Xba I and Hind III. The resulting restriction fragment 

25 (approx. 800 base pairs) was ligated into the transfer plasmid pFastBacl which was 
digested with the same restriction enzymes. The sequence of the insert was confirmed 
by DNA sequencing. The recombinant transfer plasmid pFBP703 was used to make 
recombinant bacmid DNA and baculovirus using the Bac-To-Bac Baculovirus 
expression system {BRL Life Technologies). High Five cells were infected with the 

30 recombinant virus BVP703, as described above, to obtain recombinant P703P protein. 
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e) Expression of P788P in E . Coli 

A truncated, N-temiinal portion, of P788P (residues 1-644 of SEQ ID 
NO: 777; referred to as P788P-N) fused with a C-terminal 6xHis Tag was expressed in 
E coli as follow^s. P788P cDNA was amplified using the primers AW080 and AW081 
5 (SEQ ID NO: 672 and 673). AW080 is a sense cloning primer with an Ndel site. 
AW081 is an antisense cloning primer with a Xhol site. The PCR-amplified P788P, as 
well as the vector pCRXE were digested with Ndel and Xhol. Vector and insert were 
ligated and transformed into NovaBlue cells. Colonies w'ere randomly screened for 
insert and then sequenced. P788P-N clone #6 was confirmed to be identical to the 
10 designed construct. The expression construct P788P-N #6/pCRXl was transformed 
into E. coli BL21 CodonPlus-RlL competent cells. After induction, most of the cells 
grew welE achieving OD600 of greater than 2.0 after 3 hi\ Coomassie stained SDS- 
PAGE showed an over-expressed band at about 75 kD. Western blot analysis using a 
6xHisTag antibody confimied the band was P788P-N. The determined cDNA sequence 
\5 for P788P-N is provided in SEQ ID NO: 674, with the corresponding amino acid 
sequence being provided in SEQ ID NO: 675. 

Expression of P510S in E. Coli 

The P510S protein has 9 potenrial transmembrane domains and is 
predicted to be located at the plasma membrane. The C-terminal protein of tliis 
protein, as well as the predicted third extracellular domain of P510S were expressed in 
E. coli as follow'S. 

The expression construct referred to as Ral2-P501S-C was designed to 
have a 6 HisTag at the N-tenninal enc, followed by the M tuberculosis antigen Ral2 
(SEQ ID NO: 676) and then the C-terminal portion of P510S (amino residues 1176- 
1261 of SEQ ID NO: 538). Full-length P510S was used to amplify the P510S-C 
fragment by PCR using the primers AW056 and AW057 (SEQ ID NO: 677 and 678, 
respectively). AW056 is a sense cloning primer with an EcoRI site. AW057 is an 
antisense primer with stop and Xhol sites. The amplified P501S fi-agment and 
Ral2/pCRXl were digested with EcoRI and Xhol and then purified. The insert and 
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vector were ligated together and transformed into NovaBlue. Colonies were randomly 
screened for insert and sequences. For protein expression, the expression construct was 
transformed into E. coli BL21 (DE3) CodonPlus-RIL competent cells. A mini- 
induction screen was performed to optimize the expression conditions. After induction 
5 the cells grew well, achieving OD 600 nm greater than 2.0 after 3 hours. Coomassie 
stain SDS-PAGE showed a highly over-expressed band at approx. 30 kD. Though this 
is higher than the expected molecular weight, western blot analysis was positive, 
showing this band to be the His tag-containing protein. The optimized culture 
conditions are as follows. Dilute overnight culture/daytime culture (LB + kanamycin + 

1 0 chloramphenicol) into 2xYT (with kanamycin and chloramphenicol) at a ratio of 25 ml 
culture to 1 liter 2xYT. Allow to grow at 37 ""C until OD600 = 0.6. Take an aliquot out 
as TO sample. Add 1 niM IPTG and allow to grow at 30 °C for 3 hours. Take out a T3 
sample, spin down cells and store at -80 ^C. The determined cDNA and amino acid 
sequences for the Ral2-P510S-C construct are provided in SEQ ID NO: 679 and 682, 

15 respectively. 

The expression construct P510S-C was designed to have a 5' added start codon 
and a glycine (GGA) codon and then the P5 1 OS C terminal fragment followed by the in 
frame 6x histidine tag and stop codon from the pET28b vector. The cloning strategy is 
similar to that used for Ral2-P510S-C, except that the PGR primers employed were 

20 those shown in SEQ ID NO: 685 and 686, respectively and the Ncol/Xhol cut in 
pET28b was used. The primer of SEQ ID NO: 685 created a 5' Ncol site and added a 
start codon. The antisense primer of SEQ ID NO: 686 creates a Xhol site on P510S C 
terminal fragment. Clones were confirmed by sequencing. For protein expression, the 
expression construct was transformed into E, coli BL21 (DE3) CodonPlus-RIL 

25 competent cells. An OD600 of greater than 2.0 was obtained 30 hours after induction. 
Coomassie stained SDS-PAGE showed an over-expressed band at about 11 kD. 
Western blot analysis confirmed that the band was P510S-C, as did N-terminal protein 
sequencing. The optimized culture conditions are as follows: dilute overnight 
culture/daytime culture (LB + kanamycin + chloramphenicol) into 2x YT (+ kanamycin 

30 and chloramphenicol) at a ratio of 25 mL culture to 1 liter 2x YT, and allow to grow at 
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37 ''C until an OD 600 of about 0.5 is reached. Take out an aliquot as TO sample. Add 
1 mM IPTG and allow to grow at 30 for 3 hours. Spin down the cells and store at - 
80 "^C until purification. The determined cDNA and ainino acid sequences for the 
P510S'C construct are shown in SEQ ID NO: 680 and 683, respectively. 
5 The predicted third extracellular domain of P510S (P510S-E3; residues 

328-676 of SEQ ID NO: 538) was expressed in coli as follow s. The P510S fragment 
was amplified by PCR using the primers showTi in SEQ ID NO: 687 and 688. The 
primer of SEQ ID NO: 687 is a sense primer with an Ndel site for use in ligating into 
pPDM. The primer of SEQ ID NO: 688 is an antisense primer with an added Xhol site 

1 0 for use in ligating into pPDM. Tlie resulting fragment was cloned to pPDM at the Ndel 
and Xhol sites. Clones were confirmed by sequencing. For protein expression, the 
clone ws transformed into coli BL21 (DE3) CodonPlus-RIL competent cells. After 
induction, an OD600 of greater than 2.0 was achieved after 3 hours. Coomassie stained 
SDS-PAGE showed an over-expressed band at about 39 kD, and N-terminal sequencing 

15 confirmed the N-terminal to be that of P510S-E3. Optimized culture conditions are as 
follows: dilute overnight culture/daytime culture (LB + kanamycin + chloramphenicol) 
into 2x YT (kanamycin and chloramphenicol) at a ratio of 25 ml culture to 1 liter 2x 
YT. Allow to grow at 37 ""C until OD 600 equals 0.6. Take out an aliquot as TO 
sample. Add 1 mM IPTG and allow to grow at 30 for 3 hours. Take out a T3 

20 sample, spin down the cells and store at -80 until purification. The determined 
cDNA and amino acid sequences for the P501S-E3 construct are provided in SEQ ID 
NO: 681 and 684, respectively. 

r) Expression of P775S in E. Coli 

The antigen P775P contains multiple open reading frames (ORF). The 
25 third ORF, encoding the protein of SEQ ID NO: 483, has the best emotif score. An 
expression fusion construct containing the M tuberculosis antigen RaI2 (SEQ ID NO: 
676) and P775P-ORF3 with an N-terminal 6x HisTag was prepared as follows. P775P" 
ORF3 w^as amplified using the sense PCR primers of SEQ ID NO: 689 and the anti- 
sense PCR primer of SEQ ID NO: 690. Fhe PCR amplified fragment of P775P and 
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RaI2/pCRXl were digested with the restriction enzymes EcoRI and Xhol. Vector and 
insert were ligated and then transformed into NovaBlue cells. Colonies were randomly 
screened for insert and then sequenced. A clone having the desired sequence was 
transformed into £. coli BL21 (DE3) CodonPlus-RIL competent cells. Two hours after 
induction, the cell density peaked at OD600 of approximately 1.8. Coomassie stained 
SDS-PAGE showed an over-expressed band at about 31 kD. Westem blot using 6x 
HisTag antibody confirmed that the band was Ral2-P775P-ORF3. The determined 
cDNA and amino acid sequences for the fusion construct are provided in SEQ ID NO: 
691 and 692, respectively. 



H) Expression of a P703P His tag fusion protein in E. coli 

The cDNA for the coding region of P703P was prepared by PCR using 
the primers of SEQ ID NO; 693 and 694. The PCR product was digested with EcoRI 
restriction enzyme, gel purified and cloned into a modified pET28 vector with a His tag 
15 in frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
E, coli BL21 (DE3) pLys S expression host cells. The determined amino acid and 
cDNA sequences for the expressed recombinant P703P are provided in SEQ ID NO: 
695 and 696, respectively. 

20 

I) Expression of a P705P His tag fusion protein in coli 

The cDNA for the coding region of P705P was prepared by PCR using 
the primers of SEQ ID NO: 697 and 698. The PCR product was digested with EcoRI 
restriction enzyme, gel purified and cloned into a modified pET28 vector with a His tag 
25 in frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
coli BL2I (DE3) pLys S and BL21 (DE3) CodonPlus expression host cells. The 
determined amino acid and cDNA sequences for the expressed recombinant P705P are 
provided in SEQ ID NO: 699 and 700, respectively. 

30 
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J) Expression OF A P71 IP His tag fusion protein in E.cou 

The cDNA for the coding region of P71 IP was prepared by PCR using 
the primers of SEQ ID NO: 701 and 702. The PCR product was digested with EcoRJ 
restriction enzy me, gel purified and cloned into a modified pET28 vector with a His tag 
3 in frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
E. coli BL21 (DE3) pLys S and BL21 (DE3) CodonPlus expression host cells. The 
determined amino acid and cDNA sequences for the expressed recombinant P71 IP are 
provided in SEQ ID NO: 703 and 704, respectively. 

10 

EXAMPLE 18 
Preparation And Characterization Of ANTmoDiEs 
Against Prostate-Specific Polypeptides 

IS 

a) Preparation and Characterization of Polyclonal Antibodies against P703P, 

P504S and P5Q9S 

Polyclonal antibodies against P703P, P504S and P509S were prepared as 

follows. 

20 Each prostate tumor antigen expressed in an E. coli recombinant 

expression system was grown overnight in LB broth with the appropriate antibiotics at 
37°C in a shaking incubator. The next moming, 10 ml of the overnight culture was 
added to 500 ml to 2x YT plus appropriate antibiotics in a 2L-baffled Erleruneyer flask. 
When the Optical Densit>'' (at 560 nm) of the culture reached 0.4-0.6, the cells were 

25 induced with IPTG (1 mM). Four hours after induction with IPTG, the cells were 
har\^ested by centrifugation. The cells were then washed with phosphate buffered saline 
and centrifuged again. The supernatant was discarded and the cells were either frozen 
for future use or immediately processed. 1 wenty ml of lysis buffer was added to the 
cell pellets and vortexed. To break open the E. coli cells, this mixture was then run 
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through the French Press at a pressure of 16,000 psi. The cells were then centrifuged 
again and the supernatant and pellet were checked by SDS-PAGE for the partitioning of 
the recombinant protein. For proteins that localized to the cell pellet, the pellet was 
resuspended in 10 niM Tris pH 8.0, 1% CHAPS and the inclusion body pellet was 
5 washed and centrifuged again. This procedure was repeated twice more, llie washed 
inclusion body pellet was solubilized with either 8 M urea or 6 M guanidine HCl 
containing 10 mM Tris pH 8.0 plus 10 mM imidazole. The solubilized protein was 
added to 5 ml of nickel-chelate resin (Qiagen) and incubated for 45 min to 1 hour at 
room temperature with continuous aghation. After incubation, the resin and protein 

10 mixture were poured through a disposable column and the flow through was collected. 
The column was then washed with 10-20 column volumes of the solubilization buffer. 
The antigen was then eluted from the column using 8M urea, 10 mM Tris pH 8.0 and 
300 mM imidazole and collected in 3 ml fractions. A SDS-PAGE gel was run to 
determine which fractions to pool for further purification. 

^5 As a final purification step, a strong anion exchange resin such as 

HiPrepQ (Biorad) was equilibrated with the appropriate buffer and the pooled fractions 
from above were loaded onto the column. Each antigen was eluted off the column with 
a increasing salt gradient. Fractions were collected as the column was run and another 
SDS-PAGE gel was run to determine which fractions from the column to pool. The 

20 pooled fractions were dialyzed against 10 mM Tris pH 8.0. The proteins were then 
vialed after filtration through a 0.22 micron filter and the antigens were frozen until 
needed for immunization. 

Four hundred micrograms of each prostate antigen was combined with 
100 micrograms of muramyldipeptide (MDP). Every four weeks rabbits were boosted 

25 with 100 micrograms mixed with an equal volume of Incomplete Freund's Adjuvant 
(IFA). Seven days following each boost, the animal was bled. Sera was generated by 
incubating the blood at 4''C for 12-4 hours followed by centrifugation. 

Ninety-six well plates were coated with antigen by incubating with 50 
microliters (typically 1 microgram) of recombinant protein at 4 ""C for 20 hours. 250 

30 microliters of BSA blocking buffer was added to the wells and incubated at room 
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temperature for 2 hours. Plates were washed 6 times with PBS 0.01% Tween. Rabbit 
sera was diluted in PBS. Fifts' microliters of diluted sera was added to each well and 
incubated at room temperature for 30 min. Plates were washed as described above 
before 50 microliters of goat anti-rabbit horse radish peroxidase (HRP) at a 1:10000 
> dilution was added and incubated at room temperature for 30 min. Plates were again 
washed as described above and 100 microliters of TMB microwell peroxidase substrate 
was added to each well. Following a 15 min incubation in the dark at room 
temperature, the colorimetric reaction was stopped with 100 microliters of IN H2SO4 
and read immediately at 450 nm. All polyclonal antibodies showed immunoreactivit>^ 
1 0 to the appropriate antigen. 

b) Preparation a nd Characterization of Antibodies against P5Q1 S 

A murine monoclonal antibody directed against the carboxy-terminus of 
the prostate-specific antigen P501S was prepared as follows. 

A truncated fragment of P501S (amino acids 355-526 of SEQ ID NO: 
15 113) was generated and cloned into the pET28b vector (Novagen) and expressed in E, 
coU as a thioredoxin fusion protein with a histidine tag. The ti^x-PSOlS fusion protein 
was purified by nickel chromatography, digested with thrombin to remove the trx 
fragment and further purified by an acid precipitation procediue followed by reverse 
phase HPLC. 

20 Mice were inununized with truncated P501 S protein. Serum bleeds from 

mice that potentially contained anti-P501S polyclonal sera were tested for P501S- 
specific reactivity using ELISA assays with purified P501S and trx-P501S proteins. 
Semm bleeds that appeared to react specifically with P501S were then screened for 
P501S reactivity by Western analysis. Mice that contained a P501S-specific antibody 

25 component w^ere sacrificed and spleen cells were used to generate anti-P501S antibody 
producing hybridomas using standard teclaniques. Hybridoma supematants w^ere tested 
for P501S-specific reactivity inifially by ELISA, aiid subsequently by FACS analysis of 
reactivity with P50 IS transduced cells. Based on these results, a monoclonal hybridoma 
referred to as 10E3 was chosen for further subcloning. A number of subclones were 
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generated, tested for specific reactivity to P501S using ELISA and typed for IgG 
isotype. The results of this analysis are shown below in Table V. Of the 16 subclones 
tested, the monoclonal antibody 10E3-G4-D3 was selected for further study. 

5 Table V 

Isotype analysis of murine anti-P501 S monoclonal antibodies 



Hybridoma clone 


Isotvne 


iji>lllllciicci [1^1 111 ^sUpci llaLoJll ^^l^l^/inij 


4D11 


IgGl 


14.6 


IGl 


IgGl 


0.6 


4F6 


IgGl 


72 


4H5 


IgGl 


13.8 


4H5-E12 


IgGl 


10.7 


4H5-EH2 


IgGl 


9.2 


4H5-H2-A10 


IgGl 


10 


4H5-H2-A3 


IgGl 


12.8 


4H5-H2-A10-G6 


IgGl 


13.6 


4H5-H2-B11 


IgGl 


12.3 


10E3 


IgG2a 


3.4 


10E3-D4 


IgG2a 


3.8 


10E3-D4-G3 


IgG2a 


9.5 


10E3-D4-G6 


IgG2a 


10.4 


10E3-E7 


IgG2a 


6.5 


8H12 


IgG2a 


0.6 



The specificity of 10E3-G4-D3 for P501S was examined by FACS 
10 analysis. Specifically, cells were fixed (2% formaldehyde, 10 minutes), permeabilized 
(0.1% saponin, 10 minutes) and stained with 10E3-G4-D3 at 0.5 - 1 |.tg/ml, followed by 
incubation with a secondary, FITC-conjugated goat anti-mouse Ig antibody 
(Pharmingen, San Diego, CA). Cells were then analyzed for FITC fluorescence using 
an Excalibur fluorescence activated cell sorter. For FACS analysis of transduced cells, 
15 B-LCL were retrovirally transduced with P501S. For analysis of infected cells, B-LCL 
were infected with a vaccinia vector that expresses P501S. To demonstrate specificity 
in these assays, B-LCL transduced with a different antigen (P703P) and uninfected B- 
LCL vectors were utilized. 10E3-G4-D3 was shown to bind with PSOlS-transduced B- 
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L.CL and also with P501 S-infected B-LCL, but not with either uninfected cells or 
P703P-transduced cells. 

l o determine whether the epitope recognized by 10E3-G4-D3 was found 
on ihe surface or in an intracellular compartment of cells. B-LCL were transduced with 
5 P501S or 11LA-B8 as a control antigen and either fixed and permeabilized as described 
abo\ e or directly stained with 10E3-G4-D3 and analyzed as above. Specific recognition 
of P50IS by 10E3-G4-D3 was found to require permeabilization, suggesting that the 
epitope.' reci>gni/ed by this antibody is intracellular. 

The reactivity of 10E3-G4-D3 with the tliree prostate tumor cell lines 
10 Lncap. PC-j and DU-145, which are known to express high, medium and very low 
levels oi P50IS, respectively, w^as examined by permeabilizing the cells and treating 
them as described above. Higher reactivity of 10E3-G4-D3 was seen with Lncap than 
with PC-3. which in turn showed higher reactivity that DU-145. These results are in 
agreement with the real time PGR and demonstrate that the antibody specifically 
15 recognizes [\>(>1S in these tumor cell lines and that the epitope recognized in prostate 
tumor cell lines is also intracellular. 

Specificity of 10E3-G4-D3 for P501S was also demonstrated by Western 
blot analysis. Ly sates from the prostate tumor cell lines Lncap, DU-145 and PC-3, from 
P501S-transiently transfected HEK293 cells, and fi-om non-transfected HEK293 cells 
20 were generated. Western blot analysis of these lysates with 10E3-G4-D3 revealed a 46 
kDa immunoreactive band in Lncap, PC-3 and P501S-transfected HEK cells, but not in 
DU-145 cells or non-transfected HEK293 cells. P501S mRNA expression is consistent 
with these results since semi-quantitative PGR analysis revealed that P501S mRNA is 
expressed in Lncap, to a lesser but detectable level in PC-3 and not at all in DU-145 
25 cells. Bacterially expressed and purified recombinant P50IS (referred to as P501SStr2) 
w^as recognized by 10E3-G4-D3 (24 kDa), as was full-length P501S that was transiently 
expressed in HEK293 cells using either the expression vector VRI012 or pCEP4. 
Although the predicted molecular weight of P501S is 60.5 kDa, both transfected and 
"native'" P50] S run at a slightly lower mobility due to its hydrophobic nature. 
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Inimunohistochemical analysis was performed on prostate tumor and a 
panel of normal tissue sections (prostate, adrenal, breast, cervix, colon, duodenum, gall 
bladder, ilevim, kidney, ovary, pancreas, parotid gland, skeletal muscle, spleen and 
testis). Tissue samples were fixed in formalin solution for 24 hours and embedded in 
5 paraffin before being sliced into 10 micron sections. Tissue sections were 
permeabilized and incubated with 10E3-G4-D3 antibody for 1 hr. HRP-labeled anti- 
mouse followed by incubation with DAB chromogen was used to visualize P501S 
iinmunoreactivity. P501S was found to be highly expressed in both normal prostate and 
prostate tumor tissue but was not detected in any of the other tissues tested. 

10 To identify the epitope recognized by 10E3-G4-D3, an epitope mapping 

approach was pursued. A series of 13 overlapping 20-21 mers (5 amino acid overlap; 
SEQ ID NO: 489-501) was synthesized that spanned the fragment of P501S used to 
generate 10E3-G4-D3. Flat bottom 96 well microtiter plates were coated with either the 
peptides or the P501S fragment used to immunize mice, at 1 microgram/ml for 2 hoxars 

15 at 37 ^C. Wells were then aspirated and blocked with phosphate buffered saline 
containing 1% (w/v) BSA for 2 hours at room temperature, and subsequently washed in 
PBS containing 0.1% Tween 20 (PBST). Purified antibody 10E3-G4-D3 was added at 
2 fold dilutions (1000 ng — 16 ng) in PBST and incubated for 30 minutes at room 
temperature. This was followed by washing 6 times with PBST and subsequently 

20 incubating with HRP-conjugated donkey anti-mouse IgG (m-L)Affinipure F(ab') 
fragment (Jackson Immunoreseai'ch, West Grove, PA) at 1:20000 for 30 minutes. 
Plates were then washed and incubated for 15 minutes in tetramethyl benzidine. 
Reactions were stopped by the addition of IN sulfuric acid and plates were read at 450 
nm using an ELISA plate reader. As shown in Fig. 8, reactivity Avas seen with the 

25 peptide of SEQ ID NO: 496 (corresponding to amino acids 439-459 of P501 S) and with 
the P501S fragment but not with the remaining peptides, demonstrating that the epitope 
recognized by 10E3-G4-D3 is localized to amino acids 439-459 of SEQ ID NO: 113. 

In order to further evaluate the tissue specificity of P501S, multi-array 
immunohistochemical analysis was performed on approximately 4700 different human 

30 tissues encompassing all the major normal organs as well as neoplasias derived from 
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these tissues. Sixty-five of these human tissue samples were of prostate origin. Tissue 
sections 0.6 mm in diameter were formalin-fixed and paraffin embedded. Samples were 
pretreated with HIER using 10 mM citrate buffer pH 6.0 and boiling for 10 min. 
Sections were stained with 10E3~G4-D3 and P501S immunoreactivity was visualized 
5 with HRP. All the 65 prostate tissues samples (5 normaf 55 untreated prostate tumors, 
5 hormone refractor}^ prostate tumors) were positive, showing distinct perinuclear 
staining. All other tissues examined were negative for P501 S expression. 

c) Preparation and Characterization of Antibodies against P 5Q3S 
0 A fragment of P503S (amino acids 113-241 of SEQ ID NO: 1 14) was 

expressed and purified from bacteria essentially as described above for P501S and used 
to immunize both rabbits and mice. Mouse monoclonal antibodies were isolated using 
standard hybridoma technology as described above. Rabbit monoclonal antibodies were 
isolated using Selected Lymphocyte Antibody Method (SLAM) technology at 
5 Immgenics Pharmaceuticals (Vancouver, BC, Canada). Table VI, below, lists the 
monoclonal antibodies that were developed against P503S. 



Table VI 



Antibody 


Species 


20D4 


Rabbit 


JAl 


Rabbit 


1A4 


Mouse 


1C3 


Mouse 


1C9 


Mouse 


1DI2 


Mouse 


2A11 


Mouse 


2H9 


Mouse 


4H7 


Mouse 


8A8 


Mouse 


8DI0 


Mouse 


9C12 


Mouse 


6D12 


Mouse 
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The DNA sequences encoding the complementarity determining regions 
(CDRs) for the rabbit monoclonal antibodies 20D4 and JAl were determined and are 

provided in SEQ ID NO: 502 and 503, respectively. 
5 In order to better define the epitope binding region of each of the 

antibodies, a series of overlapping peptides were generated that span amino acids 109- 
213 of SEQ ID NO: 114. These peptides were used to epitope map the anti-P503S 
monoclonal antibodies by ELISA as follows. The recombinant fragment of P503S that 
was employed as the immunogen was used as a positive control. Ninety-six well 

10 microtiter plates were coated with either peptide or recombinant antigen at 20 ng/well 
overnight at 4 ^C. Plates were aspirated and blocked with phosphate buffered saline 
containing 1% (w/v) BSA for 2 hours at room temperature then washed in PBS 
containing 0.1% Tween 20 (PBST). Purified rabbit monoclonal antibodies diluted in 
PBST were added to the wells and incubated for 30 min at room temperature. This was 

1 5 followed by washing 6 times with PBST and incubation with Protein-A HRP conjugate 
at a 1:2000 dilution for a further 30 min. Plates were washed six times in PBST and 
incubated with tetramethylbenzidine (TMB) substrate for a further 15 min. The reaction 
was stopped by the addition of IN sulfuric acid and plates were read at 450 nm using at 
ELISA plate reader. ELISA with the mouse monoclonal antibodies was performed with 

20 supematants fi-om tissue culture run neat in the assay. 

All of the antibodies bound to the recombinant P503S firagment, with the 
exception of the negative control SP2 supernatant. 20D4, JAl and 1D12 bound strictly 
to peptide #2101 (SEQ ID NO: 504), which corresponds to amino acids 151-169 of 
SEQ ID NO: 114. 1 C3 bound to peptide #21 02 (SEQ ID NO: 505), which corresponds 

25 to amino acids 165-184 of SEQ ID NO: 1 14. 9C12 bound to peptide #2099 (SEQ ID 
NO: 522), which corresponds to amino acids 120-139 of SEQ ID NO: 114. The other 
antibodies bind to regions that were not examined in these studies. 

Subsequent to epitope mapping, the antibodies were tested by FACS 
analysis on a cell line that stably expressed P503S to confirm that the antibodies bind to 

30 cell surface epitopes. Cells stably transfected with a control plasmid were employed as 
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a negative control. Cells were stained live with no fixative. 0.5 ug of anli-P503S 
monoclonal antibody was added and cells were incubated on ice for 30 min before being 
washed twice and incubated with a FITC-labelled goat anti-rabbit or mouse secondar>^ 
antibody for 20 min. After being washed twice, cells were analyzed with an Excalibur 

5 fluorescent activated cell sorter. The monoclonal antibodies 1C3, 1D12, 9C12, 20D4 
and JAl , but not 8D3, w^ere found to bind to a cell surface epitope of P503S. 

In order to determine which tissues express P503S, 
immunohistochemical analysis w^as performed, essentially as described above, on a 
panel of normal tissues (prostate, adrenal, breast, cervix, colon, duodenum, gall bladder, 

0 ileum, kidney, ovary, pancreas, parotid gland, skeletal muscle, spleen and testis). HRP- 
labeled anti-mouse or anti-rabbit antibody follow^ed by incubation with TMB was used 
to visualize P503S immunoreactivity. P503S was found to be highly expressed in 
prostate tissue, with lower levels of expression being observed in cervix, colon, ileum 
and kidney, and no expression being obser\^ed in adrenal, breast, duodenum, gall 

5 bladder, ovary, pajicreas, parotid gland, skeletal muscle, spleen and testis. 

Western blot analysis was used to chai-acterize anti-P503S monoclonal 
antibody specificity. SDS-PAGE was performed on recombinant (rec) P503S expressed 
in and purified from bacteria and on lysates from HEK293 cells transfected with full 
length P503S. Protein was transfeired to nitrocellulose and then Western blotted with 

0 each of the anti-P503S monoclonal antibodies (20D4, JAl, 1D12, 6D12 and 9C12) at 
an antibody concentration of 1 ug/ml. Protein was detected using horse radish 
peroxidase (HRP) conjugated to either a goat anti-mouse monoclonal antibody or to 
protein A-sepharose. The monoclonal antibody 20D4 detected the appropriate 
molecular weight 14 kDa recombinant P503S (amino acids 1 13-241) and the 23.5 kDa 

5 species in the HEK293 cell lysates transfected with full length P503S. Other anti- 
P503S monoclonal antibodies displayed similar specificity by Western blot. 

d) Preparation and Characterization of Antibodies against P703P 

Rabbits were immunized with either a truncated (P703PtrI; SEQ ID NO: 
172) or ftill-length mature form (P703Pfl; SEQ ID NO: 523) of recombinant P703P 
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protein was expressed in and purified from bacteria as described above. Affinity 
purified polyclonal antibody was generated using immunogen P703Pfl or P703Ptrl 
attached to a solid support. Rabbit monoclonal antibodies were isolated using SLAM 
teclinology at Immgenics Pharmaceuticals. Table VII below lists both the polyclonal 
5 and monoclonal antibodies that were generated against P703P. 



Table VII 



Antibody 


Immunogen 


Species/type 


Aff. Purif. P703P (truncated); #2594 


P703Ptrl 


Rabbit polyclonal 


Aff. Purif. P703P (full length); #9245 


P703Pfl 


Rabbit polyclonal 


2D4 


P703Ptrl 


Rabbit monoclonal 


8H2 


P703PtrI 


Rabbit monoclonal 


7H8 


P703Ptrl 


Rabbit monoclonal 



10 The DNA sequences encoding the complementarity determining regions 

(CDRs) for the rabbit monoclonal antibodies 8H2, 7H8 and 2D4 were determined and 
are provided in SEQ ID NO: 506-508, respectively. 

Epitope mapping studies were performed as described above. 
Monoclonal antibodies 2D4 and 7H8 were found to specifically bind to the peptides of 

1 5 SEQ ID NO: 509 (corresponding to amino acids 1 45- 1 59 of SEQ ID NO: 1 72) and SEQ 
ID NO: 510 (corresponding to amino acids 11-25 of SEQ ID NO: 172), respectively. 
The polyclonal antibody 2594 was found to bind to the peptides of SEQ ID NO: 511- 
514, with the polyclonal antibody 9427 binding to the peptides of SEQ ID NO: 515-517. 

The specificity of the anti-P703P antibodies was determined by Western 

20 blot analysis as follows. SDS-PAGE was performed on (1) bacterially expressed 
recombinant antigen; (2) ly sates of HEK293 cells and Ltk-/- cells either untransfected or 
transfected with a plasmid expressing full length P703P; and (3) supernatant isolated 
from these cell cultures. Protein was transferred to nitrocellulose and then Western 
blotted using the anti-P703P polyclonal antibody #2594 at an antibody concentration of 

25 1 ug/mL Protein was detected using horse radish peroxidase (HRP) conjugated to an 
anti-rabbit antibody. A 35 kDa immunoreactive band could be observed with 
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recombinant P703P. Recombinant P703P runs at a slightly higher molecular weight 
since it is epitope tagged. In lysates and supematants from cells transfected with full 
length P703P, a 30 kDa band corresponding to P703P was obser\'ed- To assure 
specificity, lysates from HEK293 cells stably transfected with a control plasmid were 
5 also tested and were negative for P703P expression. Other anti-P703P antibodies 
showed similar resuhs. 

Immunohistochemical studies were performed as described above, using 
anti-P703P monoclonal antibody. P703P was found to be expressed at high levels in 
normal prostate and prostate tumor tissue but was not detectable in all other tissues 
1 0 tested (breast tumor, lung tumor and normal kidney). 

e) Preparation and Characterization of Antibodies against P5Q4S 

Full-length P504S (SEQ ID IMO: 108) was expressed and purified from 
bacteria essentially as described above for P501S and employed to raise rabbit 
monoclonal antibodies using Selected Lymphocyte Antibody Method (SLAiM) 

15 technology at Immgenics Pharaiaceuticals (Vancouver, BC, Canada). The anti-P504S 
monoclonal antibody 13H4 w^as show^n by Western blot to bind to both expressed 
recombinant P504S and to native P504S in tumor cells. 

Immunohistochemical studies using 13H4 to assess P504S expression in 
various prostate tissues were performed as described above. A total of 104 cases, 

20 including 65 cases of radical prostatectomies with prostate cancer (PC), 26 cases of 
prostate biopsies and 13 cases of benign prostate hyperplasia (BPH), were stained with 
the anti-P504S monoclonal antibody 13H4. P504S shov/ed strongly cytoplasmic 
granular staining in 64/65 (98.5%) of PCs in prostatectomies and 26/26 (100% ) of PCs 
in prostatic biopsies. P504S was stained strongly and diffusely in carcinomas (4+ in 

25 91.2% of cases of PC; 3+ in 5.5%; 2+ in 2.2% and 1+ in 1.1%) and high grade prostatic 
intraepithelial neoplasia (4+ in all cases). The expression of P504S did not vary with 
Gleason score. Only 17/91 (18.7%) of cases of NP/BPH around PC and 2/13 (15.4%) of 
BPH cases were focally (l-f, no 2+ to 4+ in all cases) and w^eakly positive for P504S in 
large glands. Expression of P504S w^as not found in small atrophic glands, postatrophic 

30 hyperplasia, basal cell hyperplasia and transitional cell metaplasia in either biopsies or 
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prostatectomies. P504S was thus found to be over-expressed in all Gleason scores of 
prostate cancer (98.5 to 100% of sensitivity) and exhibited only focal positivities in 
large norma] glands in 19/104 of cases (82.3% of specificity). These findings indicate 
that P504S may be usefully employed for the diagnosis of prostate cancer. 

5 

EXAMPLE 19 
Characterization Of Cell Surface Expression And 
Chromosome Localization Of The Prostate-Specific Antigen P501 S 

10 This example describes studies demonstrating that the prostate-specific 

antigen P501S is expressed on the surface of cells, together with studies to determine 
the probable chromosomal location of P501 S. 

The protein P501S (SEQ ID NO: 113) is predicted to have 11 
transmembrane domains. Based on the discovery that the epitope recognized by the anti- 

15 P501S monoclonal antibody 10E3-G4-D3 (described above in Example 17) is 
intracellular, it was predicted that following transmembrane determinants would allow 
the prediction of extracellular domains of P501S. Fig. 9 is a schematic representation 
of the P501S protein showing the predicted location of the transmembrane domains and 
the intracellular epitope described in Example 17. Underlined sequence represents the 

20 predicted transmembrane domains, bold sequence represents the predicted exti'acellular 
domains, and italicized sequence represents the predicted intracellular domains. 
Sequence that is both bold and underlined represents sequence employed to generate 
polyclonal rabbit serum. The location of the transmembrane domains was predicted 
using HHMTOP as described by Tusnady and Simon (Principles Governing Amino 

25 Acid Composition of Integral Membrane Proteins: Applications to Topology Prediction, 
J, Mol Biol 255:489-506, 1998). 

Based on Fig. 9, the P501S domain flanked by the transmembrane 
domains corresponding to amino acids 274-295 and 323-342 is predicted to be 
extracellular. The peptide of SEQ ID NO: 518 corresponds to amino acids 306-320 of 

30 P501 S and lies in the predicted extracellular domain. The peptide of SEQ ID NO: 5 1 9, 



BNSDOCID <WO 0151633A2 I > 



wo (H/51633 



PC T/l SOI/(»1574 



195 

which is identical to the peptide of SEQ ID NO: 518 with the exception of the 
substiuition of the histidine w ith an asparginine. was synthesized as described above. A 
Cys-Gly was added to the C-temiinus of the peptide to facilitate conjugation to the 
carrier protein. Cleavage of the peptide from the solid support was carried out using the 
5 following cleavage mixture: trifluoroacetic acid:ethanediol:thioanisoI:w^ater:phenol 
(40:1:2:2:3). After cleaving for two hours, the peptide was precipitated in cold ether. 
The peptide pellet was then dissolved in lOVo v/v acetic acid and lyophilized prior to 
purification by CIS reverse phase hplc. A gradient of 5-60% acetonitrile (containing 
0.05?/o TFA) in water (containing 0.05% TFA) was used to elute the peptide. The purity 
10 of the peptide was verified by hplc and mass spectrometry, and was detemiined to be 
>95%. The purified peptide was used to generate rabbit polyclonal antisera as described 
above. 

Surface expression of P501S was examined by FACS analysis. Cells 
were stained with the polyclonal anti-P501S peptide serum at 10 pg/ml, washed, 

15 incubated with a secondar>^ FITC-conjugated goat anti-rabbit Ig antibody (ICN), w^ashed 
and analyzed for FITC fluorescence using an Excalibur fluorescence activated cell 
sorter. For FACS analysis of transduced cells, B-LCL were retrovirally transduced with 
I'SOIS. To demonstrate specificity in these assays, B-LCL transduced with an irrelevant 
antigen (P703P) or nontransduced were stained in parallel. For FACS analysis of 

20 prostate tumor cell lines, Lncap, PC-3 and DU-145 were utilized. Prostate tumor cell 
lines were dissociated from tissue culture plates using cell dissociation medium and 
stained as above. All samples were treated with propidium iodide (PI) prior to FACS 
analysis, and data was obtained from Pl-excluding (j\e., intact and non-permeabilized) 
cells. The rabbit polyclonal serum generated against the peptide of SEQ ID NO: 519 

25 was showTi to specifically recognize the surface of cells transduced to express P501S, 
demonstrating that the epitope recognized by the polyclonal serum is extracellular. 

To determine biochemically if P501S is expressed on the cell surface, 
peripheral membranes fi-om Lncap cells were isolated and subjected to Western blot 
analysis. Specifically, Lncap cells were lysed using a dounce homogenizer in 5 ml of 

30 homogenization buffer (250 mM sucrose, 10 mM HEPES, ImM EDTA, pH 8.0, 1 
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complete protease inhibitor tablet (Boehringer Mannheim)). Lysate samples were spun 
at 1000 g for 5 min at 4 ^C. The supernatant was then spun at 8000g for 1 0 min at 4 *^C. 
Supernatant from the SOOOg spin was recovered and subjected to a 100,000g spin for 30 
min at 4 '^C to recover peripheral membrane. Samples were then separated by SDS- 
5 PAGE and Western blotted with the mouse monoclonal antibody 10E3-G4-D3 
(described above in Example 17) using conditions described above. Recombinant 
purified P501S, as well as HEK293 cells transfected with and over-expressing P501S 
were included as positive controls for P501S detection. LCL cell lysate was included as 
a negative control. P501S could be detected in Lncap total cell lysate, the SOOOg 
10 (internal membrane) fraction and also in the 100,000g (plasma membrane) fraction. 
These results indicate that P501S is expressed at, and localizes to, the peripheral 
membrane. 

To demonstrate that the rabbit polyclonal antiserum generated to the 
peptide of SEQ ID NO; 519 specifically recognizes this peptide as well as the 

15 corresponding native peptide of SEQ ID NO: 518, ELISA analyses were performed. 
For these analyses, flat-bottomed 96 well microtiter plates were coated with either the 
peptide of SEQ ID NO: 519, the longer peptide of SEQ ID NO: 520 that spans the entire 
predicted extracellular domain, the peptide of SEQ ID NO: 521 which represents the 
epitope recognized by the P501S-specific antibody 10E3-G4-D3, or a P501S fragment 

20 (corresponding to amino acids 355-526 of SEQ ID NO: 113) that does not include the 
immunizing peptide sequence, at 1 |ag/ml for 2 hours at 37 °C. Wells were aspirated, 
blocked with phosphate buffered salme containing 1% (w/v) BSA for 2 hours at room 
temperature and subsequently washed in PBS containing 0.1% Tween 20 (PBST). 
Purified anti-P501S polyclonal rabbit serum was added at 2 fold dilutions (1000 ng - 

25 125 ng) in PBST and incubated for 30 min at room temperature. This was followed by 
washing 6 times with PBST and incubating with HRP-conjugated goat anti-rabbit IgG 
(H+L) Affinipure F(ab') fragment at 1:20000 for 30 min. Plates were then washed and 
incubated for 1 5 min in tetramethyl benzidine. Reactions were stopped by the addition 
of IN sulfuric acid and plates were read at 450 nm using an ELISA plate reader. As 

30 shown in Fig. 11, the anti-P501S polyclonal rabbit serum specifically recognized the 



BNSDOCID <WO 



0151633A2 t > 



wo 01/51633 



PC T/l SOI/01574 



197 

peptide of SEQ ID NO: 519 used in the immunization as well as the longer peptide of 
SEQ ID NO: 520, but did not recognize the irrelevant P501 S-derived peptides and 
fragments. 

In further studies, rabbits were immunized with peptides derived from 
5 the P50IS sequence and predicted to be either extracellular or intracellular, as shown in 
Fig. 9. Polyclonal rabbit sera were isolated and polyclonal antibodies in the serum were 
purified, as described above. To detennine specific reactivity with P501S, FACS 
analysis was employed, utilizing either B-LCL transduced with P501S or the irrelevant 
antigen P703P, of B-LCL infected with vaccinia virus-expressing P501S. For surface 

10 expression, dead and non-intact cells were excluded fi:om the analysis as described 
above. For intracellular staining, cells were fixed and permeabilized as described 
above. Rabbit polyclonal senim generated against the peptide of SEQ ID NO: 548, 
which coiTesponds to amino acids 181-198 of P501S, was found to recognize a surface 
epitope of P501S. Rabbit polyclonal serum generated against the peptide SEQ ID NO: 

15 551, which corresponds to amino acids 543-553 of P501S, was found to recognize an 
epitope that was either potentially extracellular or intracellular since in different 
experiments intact or permeabilized cells were recognized by the polyclonal sera. 
Based on similar deductive reasoning, the sequences of SEQ ID NO: 541-547, 549 and 
550, which correspond to amino acids 109-122, 539-553, 509-520, 37-54, 342-359, 

20 295-323, 217-274, 143-160 and 75-88, respectively, of P501S, can be considered to be 
potential surface epitopes of P501S recognized by antibodies. 

In further studies, mouse monoclonal antibodies were raised against 
amino acids 296 to 322 to P501S, which are predicted to be in an extracellular domain. 
A/J mice w^ere immunized with P501S/adenovirus, followed by subsequent boosts with 

25 an E. coli recombinant protein, referred to as P501N, that contains amino acids 296 to 
322 of P501S, and with pepfide 296-322 (SEQ ID NO: 755) coupled with KLH. The 
mice were subsequently used for splenic B cell fusions to generate anti-peptide 
hybridomas. The resulting 3 clones, referred to as 4F4 (IgGl, kappa), 4G5 
(IgG2a,kappa) and 9B9 (IgGLkappa), were grown for antibody production. The 4G5 

-K) mAb was purified by passing the supernatant over a Protein A-sepharose column, 
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followed by antibody elution using 0.2M glycine, pH 2.3. Purified antibody was 
neutralized by the addition of IM Tris, pH 8, and buffer exchanged into PBS. 

For ELISA analysis, 96 well plates were coated with P501S peptide 296- 
322 (referred to as P501-long), an irrelevant P775 peptide, P501S-N, P501TR2, P501S- 
5 long-KLH, P501S peptide 306-319 (referred to as P501-short)-KLH, or the irrelevant 
peptide 2073-KLH, all at a concentration of 2 ug/ml and allowed to incubate for 60 
minutes at 37 "^C. After coating, plates were washed 5X with PBS + 0.1% Tween and 
then blocked with PBS, 0.5% BSA, 0.4% Tween20 for 2 hours at room temperature. 
Following the addition of supematants or purified mAb, the plates were incubated for 

1 0 60 minutes at room temperature. Plates were washed as above and donkey anti-mouse 
IgHRP-linked secondaiy antibody was added and incubated for 30 minutes at room 
temperature, followed by a final washing as above. TMB peroxidase substrate was 
added and incubated 15 minutes at room temperature in the dark. The reaction was 
stopped by the addition of IN H2SO4 and the OD was read at 450 nM. All three hybrid 

15 clones secreted mAb that recognized pepfide 296-322 and the recombinant protein 
P501N. 

For FACS analysis, HEK293 cells were transiently transfected with a 
P501SA^R1012 expression constructs using Fugene 6 reagent. After 2 days of culture, 
cells were harvested and washed, then incubated with purified 4G5 mAb for 30 minutes 

20 on ice. After several washes in PBS, 0.5% BSA, 0.01% azide, goat anti-mouse Ig-FITC 
was added to the cells and incubated for 30 minutes on ice. Cells were washed and 
resuspended in wash buffer including 1% propidium iodide and subjected to FACS 
analysis. The FACS analysis confirmed that amino acids 296-322 of P501S are in an 
extracellular domain and are cell surface expressed. 

25 The chromosomal location of P501S was determined using the 

GeneBridge 4 Radiafion Hybrid panel (Research Genefics). The PCR primers of SEQ 
ID NO: 528 and 529 were employed in PCR with DNA pools from the hybrid panel 
according to the manufacturer's directions. After 38 cycles of amplification, the 
reaction products were separated on a 1 .2% agarose gel, and the results were analyzed 

30 through the Whitehead Institute/MIT Center for Genome Research web server 
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(http: A\^\A\-genc>nieAvi.mit.eduyCgi-biri/contig/r^ to detennine the probable 

cliromosomal location. Using this approach, P501S was mapped to the long ann of 
chromosome 1 at WI-9641 between q32 and q42. This region of chromosome 1 has 
been linked to prostate cancer susceptibility in hereditary prostate cancer (Smith al. 
Science 27-/: 1371-1374, 1996 and Berthon ef al Anr J. Hum. Genet. 62:1416-1424, 
1998). 'Hiese results suggest that P501S may play a role in prostate cancer malignancy. 

EXAMPLE 20 

RrOUI^ATlON OF EXPRESSIOTM OF THE PROSTATE-SpECIFIC ANTIGEN P50 1 S 

Steroid (androgen) hormone modulation is a common treatment modality 
in prostate c:uicer. The expression of a number of prostate tissue-specific antigens have 
previously been demonstrated to respond to androgen. The responsiveness of the 
prostate-specific antigen P501S to androgen treatment was examined in a tissue culture 
system as follows. 

Cells from the prostate tumor cell line LNCaP were plated at 1.5 x 10^ 
cells/T75 llask (for RNA isolation) or 3 x 10^ cells/well of a 6-well plate (for FACS 
analysis) and grown overnight in RPMI 1640 media containing 10% charcoal-stripped 
fetal calf scrum (BRL Life Technologies, Gaithersburg, MD). Cell culture was 
continued for an additional 72 hours in RPMI 1640 media containing 10% charcoal- 
stripped fetal calf serum, with 1 nM of the synthetic androgen Methyltrienolone 
(Rl 881 ; New England Nuclear) added at various time points. Cells were then harvested 
for RNA isolation and FACS analysis at 0, 1, 2, 4, 8, 16, 24, 28 and 72-hours post 
androgen addition. FACS analysis w^as performed using the anti-P501S antibody 10E3- 
G4-D3 and pemieabilized cells. 

For Northern analysis, 5-10 micrograms of total RNA was run on a 
formaldehyde denaturing gel, transferred to Hybond-N nylon membrane (Amersham 
Pharmacia Biotech. Piscataway, NJ), cross-linked and stained with methylene blue. The 
filter was then prehybridized with Church's Buffer (250 mM Na2HP04, 70 mM H3PO4, 
I mM EDTA. 1% SDS, 1% BSA in pH 7.2) at 65 ""C for 1 hour. P501S DNA was 
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labeled with 32P using High Prime random-primed DNA labeling kit (Boehringer 
Mannheim). Unincorporated label was removed using MicroSpin S300-HR columns 
(Amersham Pharmacia Biotech). The RNA filter was then hybridized with fresh 
Church's Buffer containing labeled cDNA overnight, washed with IX SCP (0.1 M 
5 NaCl, 0.03 M Na2HP04-7H20, 0.001 M Na2EDTA), 1% sarkosyl (n-lauroylsarcosine) 
and exposed to X-ray film. 

Using both FACS and Northern analysis, P501S message and protein 
levels were found in increase in response to androgen treatment. 

10 EXAMPLE 20 

PREPARATION] Of Fusion Proteins Of Prostate-Specific Antigens 

The example describes the preparation of a fusion protein of the prostate- 
specific antigen P703P and a truncated form of the known prostate antigen PSA. The 
15 truncated form of PSA has a 21 amino acid deletion around the active serine site. The 
expression construct for the fiision protein also has a restriction site at 3' end, 
immediately prior to the termination codon, to aid in adding cDNA for additional 
antigens. 

The full-length cDNA for PSA was obtained by RT-PCR from a pool of 
20 RNA from human prostate tumor tissues using the primers of SEQ ID NO: 607 and 608, 
and cloned in the vector pCR-Blunt II-TOPO. The resulting cDNA was employed as a 
template to make two different fragments of PSA by PGR with two sets of prhners 
(SEQ ID NO: 609 and 610; and SEQ ID NO: 611 and 612). The PGR products having 
the expected size were used as templates to make truncated forms of PSA by PGR with 
25 the primers of SEQ ID NO: 61 1 and 613, which generated PSA (delta 208-218 in amino 
acids). The cDNA for the mature form of P703P with a 6X histidine tag at the 5' end, 
was prepared by PGR with P703P and the primers of SEQ ID NO: 614 and 615. The 
cDNA for the fiision of P703P with the truncated form of PSA (referred to as FOPP) 
was then obtained by PGR using the modified P703P cDNA and the truncated form of 
30 PSA cDNA as templates and the primers of SEQ ID NO: 614 and 615. The FOPP 
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cDNA was cloned into the Ndel site and Xhol site of the expression vector pCRXl, and 
confirmed by DNA sequencing. The detennined cDNA sequence for the fusion 
construct FOPP is provided in SEQ ID NO: 616. with the amino acid sequence being 
provided in SHQ ID NO: 617. 
5 The fusion FOPP was expressed as a single recombinant protein in E. 

coli as follows. The expression plasmid pCRXlFOPP w^as transformed into the E. coli 
strain BL21-CodonPlus RIL. The transformant was show'n to express FOPP protein 
upon induction with 1 mM IPTG. The culture of the corresponding expression clone 
was inoculated into 25 ml LB broth containing 50 ug/ml kanamycin and 34 ug/ml 

10 chloramphenicol, grown at 37 to OD600 of about 1, and stored at 4 overnight. 
The culture w^as diluted into 1 liter of TB LB containing 50 ug/ml kanamycin and 34 
ug/ml chloramphenicol, and grown at 37 to OD600 of 0.4. IPTG was added to a 
llnal concentration of 1 mM, and the culture was incubated at 30 for 3 hours. The 
cells w^ere pelleted by centrifugation at 5,000 RPM for 8 min. To purify the protein, the 

15 cell pellet was suspended in 25 ml of 10 mM Tris-Cl pH 8.0, 2mM PMSF, complete 
protease inhibitor and 15 ug lysozyme. The cells were lysed at 4 °C for 30 minutes, 
sonicated several times and the lysate centrifliged for 30 minutes at 10,000 x g. The 
precipitate, which contained the inclusion body, was washed twice with 10 mM Tris-Cl 
pH 8.0 and 1% CHAPS. The inclusion body was dissolved in 40 ml of 10 mM Tris-Cl 

20 pH 8.0, 100 mM sodium phosphate and 8 M urea. The solution was bound to 8 ml Ni- 
NTA (Qiagen) for one hour at room temperature. The mixture w^as poured into a 25 ml 
column and washed with 50 ml of 10 mM Tris-Cl pH 6.3, 100 mM sodium phosphate, 
0.5% DOC and 8M urea. The bound protein w^as eluted w^ith 350 mM imidazole, 10 
mM Tris-Cl pH 8.0, 100 mM sodium phosphate and 8 M urea. The fractions containing 

25 FOPP proteins were combined and dialyzed extensively against 10 mM Tris-Cl pH 4.6, 
aliquoted and stored at - 70 ^C. 
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EXAMPLE 21 

Real-Time PGR Characterization of the Prostate-Specific Antigem P501S in 
Peripheral Blood of Prostate Cancer Patients 

5 Circulating epithelial cells were isolated from fresh blood of normal 

individuals and metastatic prostate cancer patients, mRNA isolated and cDNA prepared 
using real-time PCR procedures. Real-time PGR was performed with the Taqman™ 
procedure using both gene specific primers and probes to determine the levels of gene 
expression. 

10 Hpiihclial cells were enriched from blood samples using an 

immunomagnelic bead separation method (Dynal A.S., Oslo, Norway). Isolated cells 
were lysed and the magnetic beads removed. The lysate was then processed for poly A+ 
mRNA isolation using magnetic beads coated with 01igo(dT)25. After washing the 
beads in buffer, bead/poly A+ RNA samples were suspended in 10 mM Tris HCl pH 8.0 

15 and subjected to reversed transcription. The resulting cDNA was subjected to real-time 
PCR using gene specific primers. Beta-actin content was also determined and used for 
normalization. Samples with P501S copies greater than the mean of the normal 
samples + 3 standard deviations were considered positive. Real time PCR on blood 
samples was performed using the Taqman™ procedure but extending to 50 cycles using 

20 forward and reverse primers and probes specific for P501S. Of the eight samples tested, 
6 were positive for P501S and P-actin signal. The remaining 2 samples had no 
detectable p-actin or P501S. No P501S signal was observed in the four normal blood 
samples tested. 

25 EXAMPLE 22 

Expression of the Prostate-Specific Antigens P703P and P501 S in 
SCID Mouse-Passaged Prostate Tumors 

When considering the effectiveness of antigens in the treatment of 
30 prostate cancer, the continued presence of the antigens in tumors during androgen 
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ablation therapy is important. The presence of the prostate-specific antigens P703P and 
P501S in prostate tumor samples grown in SCID mice in the presence of testosterone 
was evaluated as follows. 

Two prostate tumors that had metastasized to the bone were removed 

5 from patients, implanted into SCID mice and grown in the presence of testosterone. 
Tumors were evaluated for niRNA expression of P703P, P501S and PSA using 
quantitative real time PCR with the SYBR green assay method. Expression of P703P 
and P501S in a prostate tumor was used as a positive control and the absence in normal 
intestine and normal heart as negative controls. In both cases, the specific mRNA was 

0 present in late passage tumors. Since the bone metastases were grown in the presence 
of testosterone, this implies that the presence of these genes would not be lost during 
androgen ablation therapy. 

EXAMPLE 23 

5 ANT1-P503S MOTvJOCLONAL ANTIBODY INHIBITS TUMOR GROWTH IN ViVO 

The ability of the anti-P503S monoclonal antibody 20D4 to suppress 
tumor formation in mice was examined as follows. 

Ten SCID mice were injected subcutaneously with HEK293 ceils that 
expressed P503S. Five mice received 150 micrograms of 20D4 intravenously at day 0 
0 (time of tumor cell injection), day 5 and day 9. Tumor size was measured for 50 days. 
Of the five animals that received no 20D4, three formed detectable tumors after about 2 
weeks which continued to enlarge tliroughout the study. In contrast, none of the five 
mice that received 20D4 formed tumors. These results demonstrate that the anti-P503S 
Mab 20D4 displays potent anti-tumor activity in vivo. 

EXAMPLE 24 

CHARACTERIZATION OF A T CELL RECEPTOR CLONE FROM A 
P501S-SPECIFIC T CELL CLONE 

T cells have a limited lifespan. However, cloning of T cell receptor 
(TCR) chains and subsequent transfer essentially enables infinite propagation of the T 
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cell specificity. Cloning of tumor-antigen TCR chains allows the transfer of the 
specificity into T cells isolated from patients that share the TCR MHC-restricting allele. 
Such T cells could then be expanded and used in adoptive transfer settings to introduce 
the tumor antigen specificity into patients carrying tumors that express the antigen. T 
5 cell receptor alpha and beta chains from a CDS T cell clone specific for the prostate- 
specific antigen P501S were isolated and sequenced as follows. 

Total mRNA from 2x10^ cells from CTL clone 4E5 (described above in 
Example 12) was isolated using Trizol reagent and cDNA was synthesized. To 
determine Va and Vb sequences in this clone, a panel of Va and Vb subtype-specific 

10 primers was synthesized and used in RT-PCR reactions with cDNA generated from 
each of the clones. The RT-PCR reactions demonstrated that each of the clones 
expressed a common Vb sequence that corresponded to the Vb7 subfamily. 
Futhermore, using cDNA generated from the clone, the Va sequence expressed was 
determined to be Va6. To clone the full TCR alpha and beta chains from clone 4E5, 

15 primers were designed that spanned the initiator and tenninator-coding TCR 
nucleotides. The primers were as follows: TCR Valpha-6 5'(sense): GGATCC — 
GCCGCCACC— ATGTCACTTTCTAGCCTGCT (SEQ ID NO: 756) BamHI site 
Kozak TCR alpha sequence TCR alpha 3' (antisensc): GTCGAC— 
TCAGCTGGACCACAGCCGCAG (SEQ ID NO: 757) Sail site TCR alpha constant 

20 sequence TCR Vbeta-7. S^sense): GGATCC—GCCGCCACC- 
ATGGGCTGCAGGCTGCTCT (SEQ ID NO: 758) BamHI site Kozak TCR alpha 
sequence TCR beta 3' (antisense): GTCGAC— TCAGAAATCCTTTCTCTTGAC (SEQ 
ID NO: 759) Sail site TCR beta constant sequence. Standard 35 cycle RT-PCR 
reactions were established using cDNA synthesized from the CTL clone and the above 

25 primers, employing the proofreading thermostable polymerase PWO (Roche, Nutley, 
NJ). 

The resultant specific bands (approx. 850 bp for alpha and approx. 950 
for beta) were ligated into the PCR blunt vector (Invitrogen) and ti'ansformed into E, 
coli, E .coli transformed with plasmids containing full-length alpha and beta chains 
30 were identified, and large scale preparations of the corresponding plasmids were 
generated. Plasmids containing full-length TCR alpha and beta chains were submitted 
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for sequencing. The sequencing reactions demonstrated the cloning of fulMength TCR 
alpha and beta chains with the determined cDNA sequences for the Vb and Va chains 
being showTi in SEQ ID NO: 760 and 761, respectively. The corresponding amino acid 
sequences are shown in SEQ ID NO: 762 and 763, respectively. The Va sequence Avas 
5 shown by nucleotide sequence alignment to be 99% identical (347/348) to Va6.2. and 
the Vb to be 99% identical to Vb7 (336/338). 

From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
1 0 various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

What is Claimed; 

1. An isolated polynucleotide comprising a sequence selected from 
the group consisting of: 

(a) sequences provided in SEQ ID NO: l-Ill, 115-171, 173-175, 
177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 381, 382 and 384-476, 524, 
526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593-606, 618-626, 630, 631, 634, 
636, 639-655, 674, 680, 681, 71 1, 713, 716, 720-722, 735, 737-739, 751, 753, 764, 765, 
773-776 and 786-788; 

(b) complements of the sequences provided in SEQ ID NO: 1-111, 
115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 381, 382 
and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593-606, 618- 
626, 630, 631, 634, 636, 639-655, 674, 680, 681, 71 1, 713, 716, 720-722, 735, 737-739, 
751, 753, 764, 765, 773-776 and 786-788; 

(c) sequences consisting of at least 20 contiguous residues of a 
sequence provided in SEQ ID NO: 1-111, 115-171, 173-175, 177, 179-305, 307-315, 
326, 328, 330, 332-335, 340-375, 381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 
536, 552, 569-572, 587, 591, 593-606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 
681, 71 1, 713, 716, 720-722, 735, 737-739, 751, 753, 764, 765, 773-776 and 786-788; 

(d) sequences that hybridize to a sequence provided in SEQ ID NO: 
1-1 II, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 
381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 593- 
606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 735, 
737-739, 751, 753, 764, 765, 773-776 and 786-788 under moderately stringent 
conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 
NO: I -111, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340- 
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375. 3S1, 382 and 384-476. 524. 526. 530. 531. 533. 535. 536, 552, 569-572, 587, 591, 
593-606, 618-626, 630, 631, 634, 636, 639-655. 674. 680, 681, 711, 713, 716, 720-722, 
735. 737-739, 751. 753. 764. 765, 773-776 and 786-788; 

(f) sequences having at Iea.st 90% identit>' to a sequence of SEQ ID 
NO; 1-11 L 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340- 
375, 381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552, 569-572, 587, 591, 
593-606, 618-626, 630, 631, 634, 636, 639-655, 674. 680, 681, 711, 713, 716, 720-722, 
735. 737-739, 751, 753, 764, 765, 773-776 and 786-788; and 

(g) degenerate variants of a sequence provided in SEQ ID NO: 1- 
111, 115-171. 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335. 340-375, 381, 
382 and 384-476, 524, 526, 530, 531, 533, 535, 536, 552. 569-572, 587, 591, 593-606, 
618-626, 630, 631, 634, 636, 639-655. 674. 680. 681. 711. 713, 716. 720-722, 735, 737- 
739, 751, 753, 764, 765, 773-776 and 786-788. 

2. An isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of: 

(a) sequences recited in SEQ ID NO: 112-114. 172, 176, 178, 327, 
^^29, 331, 336, 339, 376-380, 383, 477-483, 496, 504, 505, 519, 520, 522, 525, 527, 
532, 534, 537-551, 553-568, 573-586, 588-590, 592, 627-629, 632, 633, 635, 637, 638, 
656-671, 675, 683, 684, 710, 712, 714, 715, 717-719, 723-734, 736, 740-750, 752, 754, 
755, 766-772, 777-785 and 789-791; 

(b) sequences having at least 70% identity to a sequence of SEQ ID 
NO: 112-114, 172, 176, 178,327, 329,331,336, 339,376-380,383,477-483,496,504, 
505, 519, 520. 522, 525, 527. 532, 534, 537-551, 553-568, 573-586, 588-590, 592, 627- 
629, 632, 633, 635, 637, 638, 656-671, 675, 683, 684, 710, 712, 714, 715, 717-719, 
723-734, 736, 740-750, 752, 754, 755, 766-772, 777-785 and 789-791; 

(c) sequences having at least 90% identity to a sequence of SEQ ID 
NO: 112-114, 172, 176, 178,327,329,331,336,339,376-380,383,477-483,496,504, 
505, 519, 520, 522, 525, 527, 532, 534, 537-551, 553-568, 573-586, 588-590, 592, 627- 
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629, 632, 633, 635, 637, 638, 656-671, 675, 683, 684, 710, 712, 714, 715, 717-719, 
723-734, 736, 740-750, 752, 754, 755, 766-772, 777-785 and 789-791 ; 

(d) sequences encoded by a polynucleotide of claim 1 ; 

(e) sequences having at least 70% identity to a sequence encoded by 
a polynucleotide of claim 1 ; and 

(f) sequences having at least 90% identity to a sequence encoded by 
a polynucleotide of claim 1 . 

3. An expression vector comprising a polynucleotide of claim 1 
operably linked to an expression control sequence. 

4- A host cell transformed or transfected with an expression vector 
according to claim 3. 

5. An isolated antibody, or antigen-binding fragment thereof, that 
specifically binds to a polypeptide of claim 2. 

6. A metliod for detecting the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with a binding agent that binds 
to a polypeptide of claim 2; 

(c) detecting in the sample an amount of polypeptide that binds to 
the binding agent; and 

(d) comparing the amount of polypeptide to a predetermined cut-off 
value and therefrom determining the presence of a cancer in the patient. 

7. A fusion protein comprising at least one polypeptide according to 

claim 2. 
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8. The fusion protein of claim 7. wherein the fusion protein 
comprises a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID NO: 682, 692, 695. 699. 703 and 

709; and 

(b) sequences encoded by SEQ ID NO: 679, 691, 696, 700. 704 and 

708. 

9. An oligonucleotide that hybridizes to a sequence recited in SEQ 
ID NO: Mil, 115-171, 173-175, 177, 179-305, 307-315, 326, 328, 330, 332-335, 340- 
375, 38L 382 and 384-476, 524, 526, 530, 53L 533, 535, 536, 552, 569-572. 587. 591, 
593-606, 618-626, 630, 631, 634, 636, 639-655, 674, 680, 681, 711, 713, 716, 720-722, 
735, 737-739, 751, 753, 764, 765, 773-776 or 786-788 under moderately stringent 
conditions. 

1 0. A method for stimulating and/or expanding T cells specific for a 
tumor protein, comprising contacting T cells with at least one component selected from 
the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1 ; and 

(c) antigen-presenting cells that express a polypeptide according to 

claim 1, 

under conditions and for a time sufficient to permit the stimulation 
and/or expansion of T cells. 

11. An isolated T cell population, comprising T cells prepared 
according to the method of claim 10. 
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12. A composition comprising a first component selected from the 
group consisting of physiologically acceptable carriers and irmiiunostimulants, and a 
second component selected from the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1 ; 

(c) antibodies according to claim 5; 

(d) fusion proteins according to claim 7; 

(e) T cell populations according to claim 1 1 ; and 

(f) antigen presenting cells that express a polypeptide according to 

claim 2. 



13. A method for stimulating an immune response in a patient, 
comprising administering to the patient a composition of claim 12. 

14. A method for the treatment of a cancer in a patient, comprising 
administering to the patient a composition of claim 12. 

15. A method for determining the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide 
according to claim 9; 

(c) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; and 

(d) compare the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence of the cancer in the patient. 



16. A diagnostic kit comprising at least one oligonucleotide 
according to claim 9. 
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17. A diagnostic kit comprising at least one antibody according to 
claim 5 and a detection reagent, wherein the detection reagent comprises a reporter 
group. 

18. A method for inhibiting the development of a cancer in a patient, 
comprising the steps of: 

(a) incubating CD4+ and/or CD8"^ T cells isolated from a patient 
witli at least i>ne component selected from the group consisting of: (i) polypeptides 
according to claim 2: (ii) polynucleotides according to claim 1; and (iii) antigen 
presenting cells that express a polypeptide of claim 2, such that T cell proliferate; and 

(b) administering to the patient an effective amount of the 
proliferated T cells, 

ihereby inhibiting the development of a cancer in the patient. 
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SEQUENCE LISTING 



<110> Corixa Corporation 

Smithkline Beechan Biologicais S.A. 

Xu, Jiangchun 

Dillon, Davin C. 

Mi t Cham, Jennifer L . 

HarlocJcer, Susan L. 

Jiang, Yuqui 

Reed, Steven G. 

Kalos, Michael D. 

Fanger, Gary R. 

Retter, Marc W. 

Stolk, John A. 

Day, Craig H. 

Skeiky, Yasir A.W. 

Wang, Aijun 

Meagher, Medeleine Joy 

Vanderbrugge, Didier 

Dewerchin, Marianne 

Dehottay^. Ph. 

de Rop, Philippe 



<120> COMPOSITIONS AND METHODS FOR THE THERAPY AND 
DIAGNOSIS OF PROSTATE CANCER 



<130> 210121. 42722PC 



<140> PCT 

<141> 2001-01-16 



<160> 792 



<170> FastSEQ for Windows Version 3.0 



<210> 1 

<211> 814 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (814) 
<223> n - A, T,C or G 



<400 
tttttttttt 
atcaaatctg 
ccagggggtc 
ctccttggct 
tttcagctcc 
cttcatggac 
ctagagcggc 
gcgcgcttgg 
attccacaca 
anctaactca 
tgccagctgc 



> 1 
tttttcacag 
agggttgtct 
cagtccctct 
cacagcctt c 
atccttgctg 
agtgtccagc 
cgccaccgcg 
cgtaatcatg 
acatacgagc 
cattaattgc 
attaatgaat 



tataacagct 
ggaggacttc 
ccttacttca 
tctaggcttc 
tgagtgtctg 
acatgtcact 
gtggagctcc 
gtcataactg 
cggaagcata 
gttgcgctca 
cggccaacgc 



ctttatttct 
aatacacctc 
tccccatccc 
ccagtgcctc 
gtgcgttgtg 
ctccactctc 
agcttttgtt 
tttcctgtgt 
aagtgtaaag 
ctgnccgctt 
ncggggaaaa 



gtgagttcta 
cccccatagt 
atgccaaagg 
caggacagag 
cctccagctt 
tcagtgtgga 
ccctttagtg 
gaaattgtta 
cctggggtgc 
tccagtcngg 
gcggtttgcg 



ctaggaaatc 
gaatcagctt 
aagaccctcc 
tgggttatgt 
ctgctcagtg 
tccactagtt 
agggttaatt 
tccgctcaca 
ctaatgagtg 
aaaactgtcg 
ttttgggggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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tcTT^ccgctt ctcgctcact nantcctqcg ctcggtcntr cggctgcggg gaacggtatc 720 
actccrcaaa ggnggtatta cggttatccn naaatcnggg gatacccngg aaaaaant't 780 
":aaaggg cancaaaggg cngaaacgta aaaa 8x4 



ci a 



<210> 2 

<211> 816 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (816) 
<223> n - A, T, C or G 



<400> 
acagaaatgt 
ttcatggctg 
ct aciagt ctg 
aaqiA tgcag 
acagatgcct 
aaggaacggg 
ctqctgttaa 
gccgccaccg 
ggcgtaatca 
aaca tacgag 
catt aattgc 
ttantgaatc 
tcgctcattg 
ggtntnccgg 



tggatggtgg 
ttgqagcaat 
atgaacttcc 
atgt at ttgc 
gtgtgactcc 
gctcgtttat 
acaccccagc 
cggtggagct 
tggtcatagc 
ccggaacata 
gttgcgctca 
ngccaccccc 
atcctngcnc 
ttatccccaa 



agcacctttc 
agaaccccag 
caatcagatg 
aaagaagacg 
ggttctgact 
caccagtgag 
catcccttct 
ccagcttttg 
tgtttcctgt 
aagtgttaag 
ctgcccgctt 
cgggaaaagg 
ccggtcttcg 
acnggggata 



tatacgactt 
ttctacgagc 
agcatggatg 
aaggcagagt 
tttgaggagg 
gagcaggacg 
ttcaaaaggg 
ttccctttag 
gtgaaattgt 
cctggggtgc 
tccagtcggg 
cggttgcntt 
gctgcggnga 
cccnga 



acaggacagc 
tgctgatcaa 
at tggccaga 
ggtgtcaaat 
ttgttcatca 
t gagcccccg 
at ccactagt 
tgagggttaa 
tatccgctca 
ctaatgantg 
aaaactgtcg 
tt gggcctct 
acggttcact 



agatggggaa 
aggacttgca 
aatgaagaag 
ctttgacggc 
tgatcacaac 
ccctgcacct 
tctagaagcg 
ttgcgcgctt 
caatt Gcccc 
agctaactcn 
t gccact gen 
tccgctttcc 
cct caaaggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
816 



<210> 3 

<211> 773 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (773) 
<223> n - A,T,C or G 



<400> 
cttttgaaag 
; cel. get cct 
tcct caaaag 
" c L u cct teg 
tccatgctca 
t cgtagaact 
gtcgtataga 
ccaattcgcc 
gtgactggga 
ccagctgggc 
gaatgggnaa 
■f::^\^.ccaent 
ot- 1 ccct tec 



aagggatggc 
cactggtgat 
t cagaaccgg 
tcttctttgc 
tctgattggg 
ggggttctat 
aaggtgctcc 
ctatantgag 
aaaccctggg 
gtaatancga 
atgggacccc 
nnaccgett a 
t t tcncnccn 



tggggtgttt 
aaacgagccc 
agtcacacag 
aaatacatct 
aagttcatca 
tgctccaaca 
accatccaac 
tcgtattacg 
cgttaccaac 
aaaggcccgc 
cctgttaccg 
cactttgcca 
ctttcccccg 



aacagcagag 
cgttccttgt 
gcatctgtgc 
gcaaacttct 
gactttagtc 
gccatgaatt 
atgttctgtc 
cgcgctcact 
ttaatcgcct 
accgatcgcc 
cgcattnaac 
gcgccttanc 
gggt 1 1 cccc 



gtgcagggcg 
tgtgat catg 
cgtcaaagat 
tcttcatttc 
canntccttt 
ccccatctgc 
ctcgaggggg 
ggcegtcgtt 
tgcagcacat 
cttccaaeag 
ccccgcnggg 
gcccgct ccc 
cntcaaaccc 



ggggctcacg 
atgaacaacc 
ttgacaccac 
tggccaatca 
gatcagcagc 
tgtcctgtaa 
ggcccggtac 
ttacaacgtc 
ccccct ttcg 
ttgcgcacct 
tttngttgtt 
tttcnccttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
773 



<210> 4 
<211> 828 
<212> DNA 
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<213> Homo sapien 
<220> 

<221> misc_f eature 
<222> {1} . . . (828) 
<223> n - A,T,C or G 

<400> 4 

cctcctgagt cctactgacc tgtgctttct ggtgtggagt ccagggctgc taggaaaagg 60 

aatgggcaga cacaggtgta tgccaatgtt tctgaaatgg gtataatttc gtcctctcct 120 

tcggaacact ggctgtctct gaagacttct cgctcagttt cagtgaggac acacacaaag 180 

acgtgggtga ccatgttgtt tgtggggtgc agagatggga ggggtggggc ccaccctgga 24 0 

agagtggaca gtgacacaag gtggacactc tctacagatc actgaggata agctggagcc 300 

acaatgcatg aggcacacac acagcaagga tgacnctgta aacatagccc acgctgtcct 360 

gngggcacig ggaagcctan atnaggccgt gagcanaaag aaggggagga tccactagtt 420 

ctanagcgqc cgccaccgcg gtgganctcc ancttttgtt ccctttagtg agggttaatt 4 80 

gcgcgcttgg cntaa^catg gtcatanctn tttcctgtgt gaaattgtta tccgctcaca 540 

attccacaca acatacganc cggaaacata aantgtaaac ctggggtgcc taatgantga 600 

ctaactcaca ttaattgcgt tgcgctcact gcccgctttc caatcnggaa acctgtcttg 660 

ccncttgcat tnatqaatcn gccaaccccc ggggaaaagc gtttgcgttt tgggcgctct 720 
tccgcttccL cnctcantta ntccctncnc tcggtcattc cggctgcngc aaaccggttc 
accncctcca aagggqgtat tccggtttcc ccnaatccgg gganancc 



<210> b 

<211> 834 

<212> DNA 

<213> liomo sapien 

<220> 

<221> misc_f eature 
<222> {!)... (834) 
<223> n - A,T,C or G 



<210> 6 

<211> 818 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> (1) . . . (818) 
<223> n = A, T,C or G 



780 
828 



<400> S 

tttttttttt tttttactga tagatggaat ttattaagct tttcacatgt gatagcacat 60 
agttttaatt gcatccaaag tactaacaaa aactctagca atcaagaatg gcagcatgtt 120 
attttataac aatcaacacc tgtggctttt aaaatttggt tttcataaga taatttatac 180 
tgaagtaaat ctagccatgc ttttaaaaaa tgctttaggt cactccaagc ttggcagtta 240 
acatttggca taaacaataa taaaacaatc acaatttaat aaataacaaa tacaacattg 
taggccataa tcatatacag tataaggaaa aggtggtagt gttgagtaag cagttattag 
aatagaatac cttggcctct atgcaaatat gtctagacac tttgattcac tcagccctga 
cattcagttt tcaaagtagg agacaggttc tacagtatca ttttacagtt tccaacacat 
tgaaaacaag tagaaaatga tgagttgatt tttattaatg cattacatcc tcaagagtta 
tcaccaaccc ctcagttata aaaaattttc aagttatatt agtcatataa cttggtgtgc 
ttattttaaa ttagtgctaa atggattaag tgaagacaac aatggtcccc taatgtgatt bbu 
gatattggtc atttttacca gcttctaaat ctnaactttc aggcttttga actggaacat 720 
tgnatnacag tgttccanag ttncaaccta ctggaacatt acagtgtgct tgattcaaaa 780 
tgttattttg ttaaaaatta aattttaacc tggtggaaaa ataatttgaa atna 834 



300 
360 
420 
480 
540 
600 
660 
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<400> 6 

•.T'tirtrrt tttttttttt aagacccrca rcaatagatg gagacataca gaaatagtca 60 

.V ■■^acatct acaaaatgcc agtatcaggc ggcggcttcg aagccaaagt gatgtttgga 120 

.gtaaagrga aatattagtt ggcggargaa gcagatagtg aggaaagttg agccaataat 180 

jacgtgaagt ccgtggaagc ctgtggctac aaaaaatgtt gagccgtaga tgccgtcgga 240 

aatggrgaag ggagactcga agtactctga ggcttgtagg agggtaaaat agagacccag 300 

vaciaattgta ataagcagtg cttgaattat ttggtttcgg trgttttcta ttagactatg 360 

c;toagctcag gtgattgata ctcctgatgc gagtaatacg gatgtgttta ggagtgggac 420 

ttctagggga tttagcgggg tgatgcctgr tgggggccag tgccctccta gttggggggt 480 

aq^^qqctagg ctggagtggt aaaaggctca gaaaaatcct gcgaagaaaa aaacttctga 540 

ggtaataaat aggattatcc cgtatcgaag gcctttttgg acaggtggtg tgtggtggcc 600 

ttqgtatgtg ctttctcgtg ttacatcgcg ccatcattgg tatatggtta gtgtgttggg 660 

M ,antanggc ctantatgaa gaacttttgg antggaatta aatcaatngc ttggccggaa 720 

gtcattanga nggctnaaaa ggccctgtta ngggtctggg ctnggtttta cccnacccat 780 

ggaatncncc ccccggacna ntgnatccct attcttaa 818 



<210> 7 

<211> 817 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mi sc_ feat ure 
<222> (1) . /. (817) 
<223> n = A, T, C or G 



<400> 
tttttttttt 
cgggccctat 
ggtttgctcc 
jagtggtttg 
I o-atgagtg 
■7L act act eg 
gaagtatgta 
attggtggcc 
aggatncctt 
L caaacngtc 
gaatnttnng 
cnt tat cntn 
acnatt ggat 
cttnantgan 



7 

tttttttttt 
ttcaaagatt 
acagatttca 
gtttagacgt 
caagacgtct 
attgtcaacg 
ggaattgaag 
aattgatttg 
ngggatggga 
tctanttcct 
gaaaagggct 
aaaggtnata 
nccccanttc 
ggttattcnc 



tggctctaga 
tttaggggaa 
gagcattgac 
ccgggaattg 
tgtgatgtaa 
tcaaggagtc 
attaatccgc 
atggtaaggg 
aggcnatnaa 
gaaacgtct g 
t acaggacta 
accnctccta 
canaaanggc 
ccctngcntt 



gggggtagag 
ttaattctag 
cgtagtatac 
catctgtttt 
ttattatacn 
gcaggtcgcc 
cgtagtcggt 
gagggatcgt 
ggactangga 
aaatgttaat 
gaaaccaaat 
tnatcccacc 
cnccccccgg 
atcancc 



ggggtgctat 
gacgatgggt 
ccccggtcgt 
taagcctaat 
aatgggggct 
tggttctagg 
gttctcctag 
tgaactcgtc 
tnaatggcgg 
aanaattaan 
angaaaanta 
caatngnatt 
tgnannccnc 



agggtaaata 
atgaaactgt 
gtagcggtga 
gtggggacag 
tcaatcggga 
aataatgggg 
gttcaatacc 
tgttatgtaa 
gcangatatt 
tttngttatt 
atnntaangg 
ccccacncnn 
cttttgttcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
817 



<210> 8 

<211> 799 

<212> DNA 

<213> Homo sapien 



O20> 



'!> misc feature 



<222> (1) . . . (799) 
<223> n = A, T, C or G 



<400> 8 

catrtccggg tttactttct aaggaaagcc 

cataagqaga actttctgct ggcacgcgct 

ctgaagcgca cgtcccagaa ggtggacttg 

tacgaacagc gcctgaaagt gctggagcgg 



gagcggaagc tgctaacgtg ggaatcggtg 60 

agggacaagc gggagagcga ctccgagcgt 120 

gcactgaaac agctgggaca catccgcgag 180 

gaggtccagc agtgtagccg cgtcctgggg 240 
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tgggtggccg angcctganc cgctctgcct tgctgccccc angtgggccg ccaccccctg 300 

acctgcctgg gtccaaacac tgagccctgc tggcggactt caagganaac ccccacangg 360 

ggattttgct cctanantaa ggctcatctg ggcctcggcc cccccacctg gttggccttg 420 

tctttgangt gagccccatg tccatctggg ccactgtcng gaccaccttt ngggagtgtt 480 

ctccttacaa ccacannatg cccggctcct cccggaaacc antcccancc tgngaaggat 54,0 

caagncctgn atccactnnt nctanaaccg gccnccnccg cngtggaacc cnccttntgt 600 

tccttttcnt tnagggttaa tnncgccttg gccttnccan ngtcctncnc nttttccnnt 660 

gttnaaattg ttangcnccc nccnntcccn cnncnncnan cccgacccnn annttnnann 720 

ncctgggggt nccnncngat tgacccnncc nccctntant tgcnttnggg nncnntgccc 780 
ctttccctct nggganncg 

<210> 9 

<211> 801 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (801) 
<223> n = A,T,C or G 

<400> 9 

acgccttgat cctcccaggc tgggactggt tctgggagga gccgggcatg ctgtggtttg 60 

taangatgac actcccaaag gtggtcctga cagtggccca gatggacatg gggctcacct 120 

caaggacaag gccaccaggt gcgggggccg aagcccacat gatccttact ctatgagcaa 180 

aatcccctgt gggggcttct ccttgaagtc cgccancagg gctcagtctt tggacccang 24 0 

caggtcatgg ggttgtngnc caactggggg ccncaacgca aaanggcnca gggcctcngn 300 

cacccatccc angacgcggc tacactnctg gacctcccnc tccaccactt tcatgcgctg 360 

ttcntacccg cgnatntgtc ccanctgttt cngtgccnac tccancttct nggacgtgcg 420 

ctacatacgc ccggantcnc nctcccgctt tgtccctatc cacgtnccan caacaaattt 480 

cnccntantg caccnatt cc cacntttnnc agntttccnc nncgngcttc cttntaaaag 54 0 

ggttganccc cggaaaatnc cccaaagggg gggggccngg tacccaactn ccccctnata 600 

gctgaantcc ccatnaccnn gnct<r:natgg anccntccnt tttaannacn ttctnaactt ""'^ 

gggaanancc ctcgnccntn cccccnttaa tcccnccttg cnangnncnt cccccnntcc 720 

ncccnnntng gcntntnann cnaaaaaggc ccnnnancaa tctcctnncn cctcanttcq 780 
ccanccctcg aaatcggccn c 



801 

<210> 10 

<211> 789 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1)...{789) 
<223> n - A, T,C or G 

<400> 10 

cagtctatnt ggccagtgtg gcagctttcc ctgtggctgc cggtgccaca tgcctgtccc 60 

acagtgtggc cgtggtgaca gcttcagccg ccctcaccgg gttcaccttc tcagccctgc 120 

agatcctgcc ctacacactg gcctccctct accaccggga gaagcaggtg ttcctgccca 180 

aataccgagg ggacactgga ggtgctagca gtgaggacag cctgatgacc agcttcctgc 240 

caggccctaa gcctggagct cccttcccta atggacacgt gggtgctgga ggcagtggcc 300 

tgctcccacc tccacccgcg ctctgcgggg cctctgcctg tgatgtctcc gtacgtgtgg 360 

tggtgggtga gcccaccgan gccagggtgg ttccgggccg gggcatctgc ctggacctcg 420 

ccatcctgga tagtgcttcc tgctgtccca ngtggcccca tccctgttta tgggctccat 4 80 

tgtccagctc agccagtctg tcactgccta tatggtgtct gccgcaggcc tgggtctggt 540 

cccatttact ttgctacaca ggtantattt gacaagaacg anttggccaa atactcagcg 600 
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L t.aduaaatt ccagcaacat tggggqtgga aggcctgcci: cactqggtcc aactccccgc 660 
tcctgttaac cccatggggc tgccggcttg gccgccaatt tctgrtgcrg ccaaantnat 720 
ataac^ctct gctqccacct gttgctggc:: gaagrgcnta cngcncanct nggggggrng 780 



<210> 11 

<211> 772 

<212> DNA 

<213> Homo sapien 

<220> 

<221> iTLisc^f eature 
<222> {!)... (772) 
<223> n = A, T, C or G 



<400> 11 

cccaccctac ccaaatatta qacaccaaca cagaaaagct agcaatggat tcccttctac 60 

t.ttgttaaat aaataagtta aatatttaaa tgcctgtgtc tctgtgatgg caacagaagg 120 

arcaacaggc cacatcctga taaaaggtaa gaggggggtg gatcagcaaa aagacagtgc 180 

tgtgggctga ggggacctgg ttcttgtgtg ttgcccctca ggactcttcc cctacaaata 240 

actttcatat gttcaaatcc catggaggag tgtttcatcc tagaaactcc catgcaagag 300 

ctacattaaa cgaagctgca ggttaagggg cttanagatg ggaaaccagg tgactgagtt 360 

t:attcagctc ccaaaaaccc ttctctaggt gtgtctcaac taggaggcta gctgttaacc 420 

L:tqagcctgg gtaatccacc tgcagagtcc ccgcattcca gtgcatggaa cccttctggc 480 

■r:tccctgtat aagtccagac tgaaaccccc ttggaaggnc tccagtcagg cagccctana 540 

aactggggaa aaaagaaaag gacgccccan cccccagctg tgcanctacg cacctcaaca 600 

gcacagggtg gcagcaaaaa aaccacttta ctttggcaca aacaaaaact ngggggggca 660 

accccggcac cccnangggg gttaacagga ancngggnaa cntggaaccc aattnaggca 720 

ggcccnccac cccnaatntt gctgggaaat ttttcctccc ctaaattntt tc 772 



<210> 12 

<211> 751 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (751) 
<223> n = A, T, C or G 



<400> 12 

gccccaattc cagctgccac accacccacg gtgactgcat tagttcggat gtcatacaaa 60 

agctgattga agcaaccctc tactttttgg tcgtgagcct tttgcttggt gcaggtttca 120 

ttggctgtgt tggtgacgtt gtcattgcaa cagaatgggg gaaaggcact gttctctttg 180 

aagtanggtg agtcctcaaa atccgtatag ttggtgaagc cacagcactt gagccctttc 240 

atggtggtgt tccacacttg agtgaagtct tcctgggaac cataatcttt cttgatggca 300 

ggcactacca gcaacgtcag ggaagtgctc agccattgtg gtgtacacca aggcgaccac 360 

agcagctgcn acctcagcaa tgaagatgan gaggangatg aagaagaacg tcncgagggc 420 

acacttgctc tcagtcttan caccatanca gcccntgaaa accaananca aagaccacna 480 

cnccggctgc gatgaagaaa tnaccccncg ttgacaaact tgcatggcac tggganccac 540 

agtggcccna aaaatcttca aaaaggatgc cccatcnatt gaccccccaa atgcccactg 600 

ccaacagggg ctgccccacn cncnnaacga tganccnatt gnacaagatc tncntggtct 660 

tnaacnt gaaccctgcn tngtggctcc tgttcaggnc cnnggcctga cttctnaann 720 

^;ar:gaactcn gaagncccca cngganannc g '^51 



<210> 13 
<211> 729 
<212> DNA 
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<213> Homo saplen 
<220> 

<221> misc_f eature 
<222> (1) . . . (729) 
<223> n = A,T,C or G 

<400> 13 

gagccaggcg tccctctgcc tgcccactca gtggcaacac ccgggagctg ttttgtcctt 60 

tgtggancct cagcagtncc ctctttcaga actcantgcc aaganccctg aacaggagcc 120 

accatgcagt gcttcagctt cattaagacc atgatgatcc tcttcaattt gctcatcttt 180 

ctgtgtggtg cagccctgtt ggcagtgggc atctgggtgt caatcgatgg ggcatccttt 240 

ctgaagatct tcgggccact gtcgtccagt gccatgcagt ttgtcaacgt gggctacttc 300 

ctcatcgcag ccggcgttgt ggtcttagct ctaggtttcc tgggctgcta tggtgctaag 360 

actgagagca agtgtgccct cgtgacgttc ttcttcatcc tcctcctcat cttcattgct 420 

gaggttgcaa tgctgtggtc gccttggtgt acaccacaat ggctgagcac ttcctgacgt 480 

tgctggtaat gcctgccatc aanaaaagat tatgggttcc caggaanact tcactcaagt 540 

gttggaacac caccatgaaa gggctcaagt gctgtggctt cnnccaacta tacggatttt 600 

gaagantcac ctacttcaaa gaaaanagtg cctttccccc atttctgttg caattgacaa 660 

acgtccccaa cacagccaat tgaaaacctg cacccaaccc aaangggtcc ccaaccanaa 720 

attnaaggg -^29 

<210> 14 

<211> 816 

<212> DNA 

<213> Hoiuo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (816) 
<223> n = A, T,C or G 

<400> 14 

tgctcttcct caaagttgtt cttgttgcca taacaaccac cataggtaaa gcgggcgcag 60 

tgttcgctga aggggttgta gtaccagcgc gggatgctct ccttgcagag tcctgtgtct 120 

ggcaggtcca cgcagtgccc tttgtcactg gggaaatgga tgcgctggag ctcgtcaaag 180 

ccactcgtgt atttttcaca ggcagcctcg tccgacgcgt cggggcagtt gggggtgtct 24 0 

tcacactcca ggaaactgtc natgcagcag ccattgctgc agcggaactg ggtgggctga 300 

cangtgccag agcacactgg atggcgcctt tccatgnnan gggccctgng ggaaagtccc 360 

tganccccan anctgcctct caaangcccc accttgcaca ccccgacagg ctagaatgga 420 

atcttcttcc cgaaaggtag ttnttcttgt tgcccaancc anccccntaa acaaactctt 480 

gcanatctgc tccgnggggg tcntantacc ancgtgggaa aagaacccca ggcngcgaac 54 0 

caancttgtt tggatncgaa gcnataatct nctnttctgc ttggtggaca gcaccantna 600 

ctgtnnanct ttagnccntg gtcctcntgg gttgnncttg aacctaatcn ccnntcaact 660 

gggacaaggt aantngccnt cctttnaatt cccnancntn ccccctggtt tggggttttn 720 

cncnctccta ccccagaaan nccgtgttcc cccccaacta ggggccnaaa ccnnttnttc 780 

cacaaccctn ccccacccac gggttcngnt ggttng 816 

<210> 15 

<211> 783 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (783) 
<223> n = A,T,C or G 
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<400> 15 

ccaaggcctg ggcaggcata nacttgaagg tacaacccca ggaacccctg gtgctgaagg 60 

H^grggaaaa cacagattgg cgcctactgc ggggtgacac ggatgtcagg gtagagagga 120 

aaqacccaaa ccaggtggaa ctgtggggac tcaaggaang cacctacctg ttccagctga 180 

cagtgactag ctcagaccac ccagaggaca cggccaacgt cacagtcact gtgctgtcca 240 

ccaagcagac agaagactac tgcctcgcat ccaacaangt gggtcgctgc cggggctctt 300 

tcccacgctg gtactatgac cccacggagc agatctgcaa gagtttcgtt tatggaggct 360 

gcttgggcaa caagaacaac taccttcggg aagaagagtg cattctancc tgtcngggtg 420 

tgcaaggtgg gcctttgana ngcanctctg gggctcangc gactttcccc cagggcccct 480 

ccatggaaag gcgccatcca ntgttctctg gcacctgtca gcccacccag ttccgctgca 540 

ru^aatggctg ctgcatcnac antttcctng aattgtgaca acacccccca ntgcccccaa 600 

ccctcccaac aaagcttccc tgttnaaaaa tacnccantt ggcttttnac aaacncccgg 660 

cncctccntt ttccccnntn aacaaagggc nctngcnttt gaactgcccn aacccnggaa 720 

■i.ci:nccnnag aaaaantncc ccccctgatt cctnnaancc cctccncnaa anctnccccc 780 

occ ^ 783 



<210> 16 

<211> 801 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (801) 
<223> n = A,T,C or G 



<400> 16 

gccccaattc cagctgccac accacccacg gtgactgcat tagttcggat gtcatacaaa 60 

agctgattga agcaaccctc tactttttgg tcgtgagcct tttgcttggt gcaggtttca 120 

ttggctgtgt tggtgacgtt gtcattgcaa cagaatgggg gaaaggcact gttctctttg 180 

aagtagggtg agtcctcaaa atccgtatag ttggtgaagc cacagcactt gagccctttc 240 

atggtggtgt tccacacttg agtgaagtct tcctgggaac cataatcttt cttgatggca 300 

ggoactacca gcaacgtcag gaagtgctca gccattgtgg tgtacaccaa ggcgaccaca 360 

'K;agctgcaa cctcagcaat gaagatgagg aggaggatga agaagaacgt cncgagggca 420 

cacttgctct ccgtcttagc accatagcag cccangaaac caagagcaaa gaccacaacg 480 

ccngctgcga atgaaagaaa ntacccacgt tgacaaactg catggccact ggacgacagt 540 

tggcccgaan atcttcagaa aagggatgcc ccatcgattg aacacccana tgcccactgc 600 

cnacagggct gcnccncncn gaaagaatga gccattgaag aaggatcntc ntggtcttaa 660 

I gaactgaaa ccntgcatgg tggcccctgt tcagggctct tggcagtgaa ttctganaaa 720 

aaggaacngc ntnagccccc ccaaangana aaacaccccc gggtgttgcc ctgaattggc 780 

ggccaaggan ccctgccccn g 801 



<210> 17 

<211> 740 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (740) 
<223> n = A,T,C or G 



<400> 17 

gtuagagcca ggcgtccctc tgcctgccca ctcagtggca acacccggga gctgttttgt 60 

cctttgtgga gcctcagcag ttccctcttt cagaactcac tgccaagagc cctgaacagg 120 

agccaccatg cagtgcttca gcttcattaa gaccatgatg atcctcttca atttgctcat 180 

rtttctgtgt ggtgcagccc tgttggcagt gggcatctgg gtgtcaatcg atggggcatc 240 

ctttctgaag atcttcgggc cactgtcgtc cagtgccatg cagtttgtca acgtgggcta 300 
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cttcctcatc gcagccggcg ttgtggtctt tgctcttggt ttcctgggct gctatggtgc 360 

taagacggag agcaagtgtg ccctcgtgac gttcttcttc atcctcctcc tcatcttcat 420 

tgctgaagtt gcagctgctg tggtcgcctt ggtgtacacc acaatggctg aaccattcct 480 

gacgttgctg gtantgcctg ccatcaanaa agattatggg ttcccaggaa aaattcactc 540 

aantntggaa caccnccatg aaaagggctc caatttctgn tggcttcccc aactataccg 600 

gaattttgaa agantcnccc tacttccaaa aaaaaanant tgcctttncc cccnttctgt 660 

tgcaatgaaa acntcccaan acngccaatn aaaacctgcc cnnncaaaaa ggntcncaaa 720 

caaaaaaant nnaagggttn 74 0 



<210> 18 

<211> 802 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (802) 
<223> n = A/T,C or G 



<400> 18 

ccgctggttg cgctggtcca gngnagccac gaagcacgtc agcatacaca gcctcaatca 60 

caaggtcttc cagctgccgc acattacgca gggcaagagc ctccagcaac actgcatatg 120 

ggatacactt tactttagca gccagggtga caactgagag gtgtcgaagc ttattcttct 180 

gagcctctgt tagtggagga agattccggg cttcagctaa gtagtcagcg tatgtcccat 24 0 

aagcaaacac tgtgagcagc cggaaggtag aggcaaagtc actctcagcc agctctctaa 300 

cattgggcat gtccagcagt tctccaaaca cgtagacacc agnggcctcc agcacctgat 360 

ggatgagtgt ggccagcgct gcccccttgg ccgacttggc taggagcaga aattgctcct 420 

ggttctgccc tgtcaccttc acttccgcac tcatcactgc actgagtgtg ggggacttgg 480 

gctcaggatg tccagagacg tggttccgcc ccctcnctta atgacaccgn ccanncaacc 540 

gtcggctccc gccgantgng ttcgtcgtnc ctgggtcagg gtctgctggc cnctacttgc 600 

aancttcgtc nggcccatgg aattcaccnc accggaactn gtangatcca ctnnttctat 650 

aaccggncgc caccgcnnnt ggaactccac tcttnttncc tttacttgag ggttaaggtc 720 

acccttnncg ttaccttggt ccaaaccntn ccntgtgLcg anatngtnaa tcngg/iccxia 78 0 

tnccanccnc atangaagcc ng 8 02 



<210> 19 

<211> 731 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (731) 
<223> n = A,T,C or G 



<400> 19 

cnaagcttcc aggtnacggg ccgcnaancc tgacccnagg tancanaang cagncngcgg 60 

gagcccaccg tcacgnggng gngtctttat nggagggggc ggagccacat cnctggacnt 120 

cntgacccca actccccncc ncncantgca gtgatgagtg cagaactgaa ggtnacgtgg 180 

caggaaccaa gancaaannc tgctccnntc caagtcggcn nagggggcgg ggctggccac 24 0 

gcncatccnt cnagtgctgn aaagccccnn cctgtctact tgtttggaga acngcnnnga 300 

catgcccagn gttanataac nggcngagag tnantttgcc tctcccttcc ggctgcgcan 360 

cgngtntgct tagnggacat aacctgacta cttaactgaa cccnngaatc tnccncccct 420 

ccactaagct cagaacaaaa aacttcgaca ccactcantt gtcacctgnc tgctcaagta 4 80 

aagtgtaccc catncccaat gtntgctnga ngctctgncc tgcnttangt tcggtcctgg 540 

gaagacctat caattnaagc tatgtttctg actgcctctt gctccctgna acaancnacc 600 

cnncnntcca agggggggnc ggcccccaat ccccccaacc ntnaattnan tttanccccn 660 

cccccnggcc cggcctttta cnancntcnn nnacngggna aaaccnnngc tttncccaac 720 
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<210> 20 

<211> 754 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (754 ) 
<223> n = A, C or G 



<400> 20 

Irttrttttt tttttttttt taaaaacccc ctccattnaa tgnaaacttc cgaaattgtc 60 

caaccccctc ntccaaatnn ccntttccgg gngggggttc caaacccaan ttanntttgg 120 

annttaaatt aaatnttnnt tggnggnnna anccnaatgt nangaaagtt naacccanta 180 

tnancttnaa tncctggaaa ccngtngntt ccaaaaatnt ttaaccctta antccctccg 240 

aaatngttna nggaaaaccc aanttctcnt aaggttgttt gaaggntnaa tnaaaanccc 300 

nnccaattgt ttttngccac gcctgaatta attggnttcc gntgttttcc nttaaaanaa 360 

ggnnancccc ggttantnaa tccccccnnc cccaattata ccganttttt ttngaattgg 420 

gancccncgg gaattaacgg ggnnnntccc tnttgggggg cnggnncccc ccccntcggg 4 80 

ggttngggnc aggncnnaat tgtttaaggg tccgaaaaat ccctccnaga aaaaaanctc 540 

ccaggntgag nntngggttt nccccccccc canggcccct ctcgnanagt tggggtttgg 600 

ggqgcctggg attttntttc ccctnttncc tccccccccc ccnggganag aggttngngt 660 

tttgntcnnc ggccccnccn aaganctttn ccganttnan ttaaatccnt gcctnggcga 720 

agtccnttgn agggntaaan ggccccctnn cggg 754 



<210> 21 

<211> 755 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (755) 
<223> n = A,T,C or G 



<400> 21 

dtcancccat gaccccnaac nngggaccnc tcanccggnc nnncnaccnc cggccnatca 60 

nngtnagnnc actncnnttn natcacnccc cnccnactac gcccncnanc cnacgcncta 120 

nncanatncc actganngcg cgangtngan ngagaaanct nataccanag ncaccanacn 180 

ccagctgtcc nanaangcct nnnatacngg nnnatccaat ntgnancctc cnaagtattn 240 

nncnncanat gattttcctn anccgattac ccntnccccc tancccctcc cccccaacna 300 

cgaaggcnct ggnccnaagg nngcgncncc ccgctagntc cccnncaagt cncncnccta 360 

aactcanccn nattacncgc ttcntgagta tcactccccg aatctcaccc tactcaactc 420 

aaaaanatcn gatacaaaat aatncaagcc tgnttatnac actntgactg ggtctctatt 4 80 

tt.agnggtcc ntnaancntc ctaatacttc cagtctncct tcnccaattt ccnaanggct 54 0 

ctttcngaca gcatnttttg gttcccnntt gggttcttan ngaattgccc ttcntngaac 600 

gggctcntct tttccttcgg ttancctggn ttcnnccggc cagttattat ttcccntttt 660 

aaattcntnc cntttanttt tggcnttcna aacccccggc cttgaaaacg gccccctggt 720 

naitaqgttgt tttganaaaa tttttgtttt gttcc 755 



<210> 22 

<211> 849 

<212> DNA 

<213> Homo sapien 

<220> 



BNSDOCiO <--W: '■^^,'€:^^Ar i 
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<221> misc_feature 
<222> (1) . . . (849) 
<223> n - A,T,C or G 



<400> 22 

tttttttttt tttttangtg tngtcgtgca ggtagaggct 
acgctnggan taangcgacc cganttctag ganncnccct 
atcctgnnna cggaanggtc accggnngat nntgctaggg 
cataactcng nggccctgcc caccaccttc ggcggcccng 
gnnttaaccn cactnngcna ncggtttccn nccccnncng 
tctgtcttcc cctgnagncn anaaantggg ccncggnccc 
cngccntcta nccncngccc cccctccant nngggggact 
nnaccccnnn gggtncctcg gttgtcgant cnaccgnang 
tgcgttnttg gcccctaccc ttcgctncgg nncacccttc 
cncnncgnng cctcncctcg caacacccgc nctcntcngt 
nccctcncnc ngncgnancn ctccnccncc gtctcannca 
ntcanccacn ggnngacnng nagcncnntc gcnccgcgcn 
ctncntcngg ccantnncgc tcaanccnna cnaaacgccg 
ncctccncga gtcctcccgn cttccnaccc angnnttccn 
nncangcgg 



tactacaant 
aaaatcanac 
tgnccnctcc 
ngnccgggcc 
acccnggcga 
ctttacccct 
gccnanngct 
ccanggattc 
ccgacnanga 
ncggnnnccc 
ccaccccgcc 
gcgncnccct 
ctgcgcggcc 
cgaggacacn 



gtgaanacgt 
tgtgaagatn 
cannncnttn 
cgggtcattn 
tccggggtnc 
nnacaagcca 
ccgttnctng 
cnaaggaagg 
nccgctcccg 
ccccacccgc 
ccgccaggcc 
cgccncngaa 
cgnagcgncc 
nnaccccgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
849 



<210> 23 

<211> 872 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_f eature 
<222> (1) . . . (872) 
<223> n = A, T,C or G 



<400> 23 

^ _ j^--^-..^ ^.^ yiicLou^-y uyo gc;cLogcLric tcttrucctc cgcaaccatg 60 

tctgacnanc ccgattnggc ngatatcnan aagntcganc agtccaaact gantaacaca 120 

cacacncnan aganaaatcc nctgccttcc anagtanacn attgaacnng agaaccangc 180 

nggcgaatcg taatnaggcg tgcgccgcca atntgtcncc gtttattntn ccagcntcnc 240 

ctnccnaccc tacntcttcn nagctgtcnn acccctngtn cgnacccccc naggtcggga 300 

tcgggtttnn nntgaccgng cnncccctcc ccccntccat nacganccnc ccgcaccacc 360 

nanngcncgc nccccgnnct cttcgccncc ctgtcctntn cccctgtngc ctggcncngn 420 

accgcattga ccctcgccnn ctncnngaaa ncgnanacgt ccgggttgnn annancgctg 480 

tgggnnngcg tctgcnccgc gttccttccn ncnncttcca ccatcttcnt tacngggtct 540 

ccncgccntc tcnnncacnc cctgggacgc tntcctntgc cccccttnac tccccccctt 600 

cgncgtgncc cgnccccacc ntcatttnca nacgntcttc acaannncct ggntnnctcc 660 

cnancngncn gtcanccnag ggaagggngg ggnnccnntg nttgacgttg nggngangtc 720 

cgaanantcc tcnccntcan cnctacccct cgggcgnnct ctcngttncc aacttancaa 780 

ntctcccccg ngngcncntc tcagcctcnc ccnccccnct ctctgcantg tnctctgctc 84 0 

tnaccnntac gantnttcgn cnccctcttt cc 872 

<210> 24 

<211> 815 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_f eature 
<222> (1) . . . (815) 
<223> n = A, T,C or G 
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<400> 24 

gcatgcaaqc ttgagtattc tatagngtca cctaaatanc ttggcntaat catggtcnta 60 

r.ctgnczzcc tgrgtcaaat gtatacnaan tanatatgaa tctnarntga caaganngta 120 

rcntncatta gtaacaantg tnntgtccat cctgtcngan canattccca tnnattncgn 180 

cqcattcncn gcncantatn taatngggaa ntcnnntnnn ncaccnncat ctatcntncc 240 

gciijcctgac tggnagagat ggatnanttc tnntntgacc nacatgttca tcttggattn 300 

aananccccc cgcngnccac cggttngnng cnagccnntc ccaagacctc ctgtggaggt 360 

aacctgcgtc aganncatca aacntgggaa acccgcnncc angtnnaagt ngnnncanan 420 

garcccgtcc aggnttnacc atcccttcnc agcgccccct ttngtgcctt anagngnagc 480 

qtorccnanc cnctcaacat ganacgcgcc agnccanccg caattnggca caatgtcgnc 540 

jTioCccccra gggggantna tncaaanccc caggattgtc cncncangaa atcccncanc 600 

cccnccctac ccnnctttgg gacngtgacc aantcccgga gtnccagtcc ggccngnctc 660 

ccccaccggt nnccntgggg gggtgaanct cngnr.tcanc cngncgaggn ntcgnaagga 720 

riccggncctn ggncgaanng ancnntcnga agngccncnt cgtataaccc cccctcncca 780 

nccnacngnt agntcccccc cngggtncgg aangg 815 

<210> 25 

<211> 775 

<212> DNA 

<213> Homo sapien 

<220> 

<221> iuisc_f eature 
<222> (1) . . . (775) 

<223> n = A, T, C or G 

<400> 25 

ccgagatgtc tcgctccgtg gccttagctg tgctcgcgct actctctctt tctggcctgg 60 

aggctatcca gcgtactcca aagattcagg tttactcacg tcatccagca gagaatggaa 120 

agtcaaattt cctgaattgc tatgtgtctg ggtttcatcc atccgacatt gaanttgact 180 

tactgaagaa tgganagaga attgaaaaag tggagcattc agacttgtct ttcagcaagg 240 

actggtcttt ctatctcntg tactacactg aattcacccc cactgaaaaa gatgagtatg 300 

cctgccgtgt gaaccatgtg actttgtcac agcccaagat agttaagtgg gatcgagaca 360 

1 qtaagcagn cnncatqgaa gtttgaagat gccgcatttg gattggatga attccaaatt 420 

ctgcttgctt gcnttttaat antgatatgc ntatacaccc taccctttat gnccccaaat 480 

tgtaggggtt acatnantgt tcncntngga catgatcttc ctttataant ccnccnttcg 540 

aattgcccgt cncccngttn ngaatgtttc cnnaaccacg gttggctccc ccaggtcncc 600 

Lei t acggaa gggcctgggc cnctttncaa ggttggggga accnaaaatt tcncttntgc 660 

ccncccncca cnntcttgng nncncanttt ggaacccttc cnattcccct tggcctcnna 720 

nccttnncta anaaaacttn aaancgtngc naaanntttn acttcccccc ttacc 775 

<210> 26 

<211> 820 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (820) 

<223> n = A,T,C or G 

<400> 26 

unattancac agtgtaatct tttcccagag gtgtgtanag ggaacggggc ctagaggcat 60 

cccanagata ncttatanca acagtgcttt gaccaagagc tgctgggcac atttcctgca 120 

gaaaaggtgg cggtccccat cactcctcct ctcccatagc catcccagag gggtgagtag 180 

ccatcangcc ttcggtggga gggagtcang gaaacaacan accacagagc anacagacca 240 

■iU':icit.gacca tgggcgggag cgagcctctt ccctgnaccg gggtggcana nganagccta 300 

nctgaggggt cacactataa acgttaacga ccnagatnan cacctgcttc aagtgcaccc 360 
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tt cctacctg 
acnnagcact 
ccctgttgga 
gatggaattt 
tccctctntt 
ganattccac 
gggnncctcg 
tccaaccntc 



acnaccagng 
cacctgcccc 
attncgggga 
tncccttccg 
ntcctgncnc 
tnncgcctnc 
ntcatcctct 
gntggccntn 



accnnnaact 
cccatggccg 
naccaaggga 
gccnntcccc 
acttttnacc 
cntcnatcng 
ctttttcnct 
cccccccnnn 



gcngcctggg 
tncgcntccc 
nccccctcct 
tcttccttta 
ccnnnatttc 
naanacnaaa 
accnccnntt 
tcctttnccc 



gacagcnctg 
tggtcctgnc 
ccanctgtga 
cacgccccct 
ccttnattga 
nactntctna 
ctttgcctct 



ggancagcta 
aagggaagct 
aggaaaaann 
nntactcntc 
tcggannctn 
cccnggggat 
ccttngatca 



<210> 27 

<211> 818 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (818) 
<223> n = A,T,C or G 



<400 
tctgggtgat 
tgtttcttct 
ctgcggatgc 
ctgctgagca 
tccgcctcca 
ttcttcctgc 
gatctcagtt 
tatnaccnan 
nctcccttcc 
ctcctttgcc 
ctgntnnccc 
tnnctcttcn 



> 27 
ggcctcttcc 
ccgagcccca 
tgtgacggac 
ctt ccgcccc 
gggttctgct 
cccntccctg 
tccctcnctc 
tggnctgtnc 
anttcnnnna 
ctnaccangg 
cnctcncnnt 
ngtntcgnaa 



tcctcaggga 
ggcagcggtg 
ccaaggggca 
tcaccctgcc 
cttccangca 
gctctgantc 
anngaactct 
tgtcnnactt 
accngcttnc 
gccnnnaccg 
tncctcgtcc 
ngntcncntn 
ncngnncc cc 



cccnnccccc ngnattaagg cctccnntct 

<210> 28 

<211> 731 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> fl) . . . (731) 
<223> n - A, T,C or G 



cctctgactg 
attcagccct 
aatagggtcc 
cagcccctgc 
ngccancaag 
tctgtcttcc 
gtttctgann 
taatgggccn 
cntcntctcc 
cccntnnctn 
cnncnncgcn 
tnnnnngncn 
nx'irmciirinnn 
ccggccnc 



ctctgggcca 
gcccaacctg 
cagggtccag 
catgagctct 
tggcgctggg 
tgtcctgtgc 
tcttcantta 
gaccggctaa 
ccntancccg 

ggggggcnng 

nngcannttc 
ngntnntncn 
nggnnntnnn 



aagaatctct 
attctgatga 
ggaggggcgc 
gggctgggtc 
ccacactggc 
angcnccttg 
actntgantt 
tccctccctc 
ccngggaanc 
gtnnctncnc 
ncngtcccnn 
tccctctcnc 
tctncncngc 



<400 
aggaagggcg 
tcccaacatg 
gattnaaccc 
ntanattcct 
attnctcccg 
actaaagntt 
tnnnttncct 
nnngcgncnc 
cgtttcncat 
nggttcncct 
gnaatgggta 
tctcnacccc 



> 28 
gagggatatt 
anggtgnngt 
cattgtatgg 
gtnaatcgga 
ggtagtgcat 
naagtgggan 
tcgccctntg 
tgaaannnnc 
naaggcactt 
acgctnntng 
gggncttntc 
cccccttttt 



gtangggatt 
tctcttttga 
agnnaaaggn 
aaatnatntt 
nttngggggn 
tncaaatgaa 
actctgcnng 
tcgnggctnn 
tngcctcatc 
cncctnnntn 
ttttnaccnn 
caatcccanc 



gagggatagg 
angagggttg 
tttnagggat 
tcnncnggaa 
cngccangtt 
aacctnncac 
agcccaatac 
gancatcang 
caaccnctng 
ganattttnc 
gnggtntact 
ggcnaatggg 



agnataangg 
ngtttttann 
ttttcggctc 
aatnttgctc 
tcccaggctg 
agagnatccn 
ccnngngnat 
gggtttcgca 
ccctcnncca 
ccgcctnggg 
aatcnnctnc 
gtctccccnn 



gggaggtgtg 
ccnggtgggt 
ttatcagtat 
ccatccgnaa 
ctanaatcgt 
tacccgactg 
gtcncccngn 
tcaaaagcnn 
tttngccgtc 
naancctcct 
acgcntnctt 
cgangggggg 



420 
480 
540 
600 
660 
720 
780 
820 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
818 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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731 



<210> 


29 


<211> 


822 


<212> 


DNA 


<213> 


Homo 



sapien 

<220> 

<221> misc_feature 
<222> (!}... (822) 
<223> n = A, T, C or G 



<400 
actagt ccag 
^'^gctcanacc 
atntntacnc 
t nnctantct 
r cnccatntn 
tccatnantt 
t actctgact 
nt caacaacc 
ncaaataccc 
ccactggaat 
^atnctcctn 
t anatccctt 
ccnaatgaag 
canatcctat 



> 29 
t gtggt ggaa 
t cacancctc 
tcatanncct 
ntgccgcctn 
gcct ananta 
annntaacta 
cccacngcct 
tatctanctg 
nccacctgac 
cacnatngga 
naatt tactn 
ct t tcgaaaa 
gncncccaat 
cccttanttn 



t tccattgtg 
ccnacnangc 
cnnnacccac 
cnanccaccn 
ngtncatacc 
ccactgacnt 
annnattagc 
ttcnccaacc 
ncctaacccn 
naaaaaaaac 
ncantnccat 
ccnaccct 1 1 
cnangaaacg 
ggggnccctt 



ttggggncnc 
ctataangaa 
tccct cttaa 
gtgggccnac 
ctatacctac 
ngactttcnc 
ancntccccc 
nttncctccg 
caccatcccg 
ccnaactctc 
caancccacn 
annncccaac 
nccntgaaaa 
ncccngggcc 



ttctatgant 
nannaataga 
cccntactgt 
cncnngnatt 
nccaatgcta 
atnanctcct 
nacnatntct 
atccccnnac 
gcaagccnan 
tancncnnat 
tgaaacnnaa 
ct t tngggcc 
ancnaggcna 
cc 



antntt agat 
nctgtncnnt 
gcctatngcn 
ctcnatctcc 
nnnctaancn 
aatttgaatc 
caaccaaatc 
aacccccctc 
ggncat ttan 
ctccctaana 
cccctgtttt 
cccccnctnc 
anannntccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
822 



<210> 30 

<211> 787 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (787) 



<223> n 



A, T, C or G 



<400 
cggccgcctg 
ctagagaaga 
gt ctgcagga 
gctggaagcc 
acaccagggg 
cccatggggc 
ggccgtggga 
tcccnttaat 
gtgaaattgt 
taaagcctgg 
ccgcttt ccn 
aaaagcggtt 
cqqtcgttnc 
cc:ccaaa 



> 30 
ctctggcaca 
cctt ctctcc 
tttgatgtct 
ctggagggcc 
ctccaggcag 
ctgnaaggcc 
tccactant t 
gaaggttaat 
ttntcccct c 
gggtngcctn 
ttcnggaaaa 
tgcnttttng 
nggtngcggg 



tgcctcctga 
tactgtcatt 
gaagt cgtgg 
tct ctcgcca 
cccatt attc 
agggtctcct 
ctanaacggn 
tgcncgcttg 
ncnatt ccnc 
nngaatnaac 
ctgtcntccc 
ggggntcctt 
gaangggnat 



atggcatcaa 
atggagccct 
agtgtggctt 
gcctccccct 
ccagnangac 
ttgacaccat 
cgccaccncg 
gcgtaatcat 
ncnacatacn 
tnaactcaat 
ct gcnttnnt 
ccncttcccc 
nnnctcccnc 



aagtgatgga 
gcagactgag 
ggagctcct c 
t ct ctccacg 
atgqtgtttc 
ctctcccgtc 
gtgggagctc 
nggtcanaac 
aacccggaan 
taattgcgtt 
gaatcggcca 
cctcnctaan 
naagggggng 



ctgcccattg 
ggctcccctt 
at ct acatna 
ctctccangg 
tccacgcgga 
ctgcctggca 
cagcttttgt 
tnttt cctgt 
cataaagtgt 
ggct catggc 
ccccccnggg 
ccctncgcct 
agnnngntat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
787 



<210> 31 

<211> 799 

<212> DNA 

<213> Homo sapien 
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<220> 

<221> iflISc feature 
<222> (1) .7. (799) 
<223> n = A,T,C or G 



<400> 31 

tttttttttt tttttttggc gatgctactg tttaattgca ggaggtgggg gtgtgtgtac 60 

catgtaccag ggctattaga agcaagaagg aaggagggag ggcagagcgc cctgctgagc 120 

aacaaaggac tcctgcagcc ttctctgtct gtctcttggc gcaggcacat ggggaggcct 180 

cccgcagggt gggggccacc agtccagggg tgggagcact acanggggtg ggagtgggtg 240 

gtggctggtn cnaatggcct gncacanatc cctacgattc ttgacacctg gatttcacca 300 

ggggaccttc tgttctccca nggnaacttc ntnnatctcn aaagaacaca actgtttctt 360 

cngcanttct ggclgttcat ggaaagcaca ggtgtccnat ttnggctggg acttggtaca 420 

tatggttccg qcccaoctot cccntcnaan aagtaattca cacccccccn ccntctnttg 430 

cctgggccct taantaccca caccggaact canttantta ttcatcttng gntgggcttg 540 

ntnatcnccn cctgaangcg ccaagttgaa aggccacgcc gtncccnctc cccatagnan 600 

nttttnncnt canctaatgc ccccccnggc aacnatccaa tccccccccn tgggggcccc 660 

agcccanggc ccccqnctcg ggnnnccngn cncgnantcc ccaggntctc ccantcngnc 720 

ccnnngcncc cccgcacgca gaacanaagg ntngagccnc cgcannnnnn nggtnncnac 780 

ctcgcccccc ccnncgnnq 799 



<210> 32 

<211> 789 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (789) 
<223> n = A,T,C or G 



tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 60 

ttttnccnag ggcaggttta ttgacaacct cncgggacac aancaggctg gggacaggac 120 

ggcaacaggc tccggcggcg gcggcggcgg ccctacctgc ggtaccaaat ntgcagcctc 180 

cgctcccgct tgatnttcct ctgcagctgc aggatgccnt aaaacagggc ctcggccntn 24 0 

ggtgggcacc ctgggatttn aatttccacg ggcacaatgc ggtcgcancc cctcaccacc 300 

nattaggaat agtggtntta cccnccnccg ttggcncact ccccntggaa accacttntc 360 

gcggctccgg catctggtct taaaccttgc aaacnctggg gccctctttt tggttantnt 420 

nccngccaca atcatnactc agactggcnc gggctggccc caaaaaancn ccccaaaacc 4 80 

ggnccatgtc ttnncggggt tgctgcnatn tncatcacct cccgggcnca ncaggncaac 54 0 

ccaaaagttc ttgnggcccn caaaaaanct ccggggggnc ccagtttcaa caaagtcatc 600 

ccccttggcc cccaaatcct ccccccgntt nctgggtttg ggaacccacg cctctnnctt 660 

tggnnggcaa gntggntccc ccttcgggcc cccggtgggc ccnnctctaa ngaaaacncc 720 

ntcctnnnca ccatcccccc nngnnacgnc tancaangna tccctttttt tanaaacggg 780 

ccccccncg 7g9 



<210> 33 

<211> 793 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (793) 
<223> n = A, C or G 
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<400> 33 

qacagaacat gttggatggt ggagcacctt tctatacgac ttacaggaca gcagatgggg 60 

Tjattcatggc tgrtggagca atanaacccc agttctacga gctgctgatc aaaggacttg 120 

aaCLaaagtc tgatgaactt cccaatcaga tgagcatgga tgattggcca gaaatgaana 180 

agaagtttgc agatgtattt gcaaagaaga cgaaggcaga gtggtgtcaa atctttgacg 240 

gcacagatgc ctgtgtgact ccggttctga cttttgagga ggttgttcat catgatcaca 300 

acaangaacg gggctcgttt atcaccantg aggagcagga cgtgagcccc cgccctgcac 360 

::tct.qctqtt aaacacccca gccatccctt ctttcaaaag ggatccacta cttctagagc 420 

ggncgccacc gcggtggagc tccagctttt gttcccttta gtgagggtta attgcgcgct 480 

tqqcgtaatc atggtcatan ctgtttcctg tgtgaaattg ttatccgctc acaattccac 540 

.iCciricatacg anccggaagc atnaaatttt aaagcctggn ggtngcctaa tgantgaact 600 

nactcacatt aattggcttt gcgctcactg cccgctttcc agtccggaaa acctgtcctt 660 

gccaqctgcc nttaatgaat cnggccaccc cccggggaaa aggcngtttg cttnttgggg 720 

cgcncttccc gctttctcgc ttcctgaant ccttcccccc ggtctttcgg cttgcggcna 780 

ac'qqtalcna cct 7 93 



<210> 34 

<211> 756 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> {!)... (756) 
<223> n - A, T,C or G 



<400> 34 

qccgcgaccg gcatgtacga gcaactcaag ggcgagtgga accgtaaaag ccccaatctt 60 

.'iiicaagtgcg gggaanagct gggtcgactc aagctagttc ttctggagct caacttcttg 120 

ccaaccacag ggaccaagct gaccaaacag cagctaattc tggcccgtga catactggag 180 

atcggggccc aatggagcat cctacgcaan gacatcccct ccttcgagcg ctacatggcc 240 

cagctcaaat gctactactt tgattacaan gagcagctcc ccgagtcagc ctatatgcac 300 

■.:ar;::t cttgg gcctcaacct cctcttcctg ctgtcccaga accgggtggc tgantnccac 360 

:v;qqdnttqg ancggctgcc tgcccaanga catacanacc aatgtctaca tcnaccacca 420 

gtgtcctgga gcaatactga tgganggcag ctaccncaaa gtnttcctgg ccnagggtaa 480 

catcccccgc cgagagctac accttcttca ttgacatcct gctcgacact atcagggatg 540 

'1. jc3 3tcgcng ggttgctcca gaaaggctnc aanaanatcc ttttcnctga aggcccccgg 600 

.it I. q-iCtagt nctagaatcg gcccgccatc gcggtgganc ctccaacctt tcgttnccct 660 

Liactgaggg ttnattgccg cccttggcgt tatcatggtc acnccngttn cctgtgttga 720 

aattnttaac cccccacaat tccacgccna cattng 756 



<210> 35 

<211> 834 

<212> DNA 

v213> Homo sapien 

<220> 

<221> misc_f eature 
<222> CD . . . (834) 
<223> n = A, T, C or G 



<400> 35 

qc^vfqalctct anatcnacct gnatgcatgg ttgtcggtgt ggtcgctgtc gatgaanatg 60 

.-jacaqqatct tgcccttgaa gctctcggct gctgtnttta agttgctcag tctgccgtca 120 

■^"antcagaca cnctcttggg caaaaaacan caggatntga gtcttgattt cacctccaat 180 

ciri I tcngg gctgtctgct cggtgaactc gatgacnang ggcagctggt tgtgtntgat 240 

'^aantccanc angttctcct tggtgacctc cccttcaaag ttgttccggc cttcatcaaa 300 

■:jt.. tctnnaan angannancc canctttgtc gagctggnat ttgganaaca cgtcactgtt 360 
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ggaaactgat 
ggcncaaatc 
nncaangact 
ttcttcagcc 
ggaanccgtc 
acntnctggg 
nccnaacttt 
gctnttggcc 



cccaaatggt 
cgact ccccn 
ctnccgctnc 
agttcacnat 
tctcccttcc 
ccgggttcaa 
ttccttcccc 
antcccctgg 



atgtcatcca 
tccttgaaag 
cccntccnng 
nttcatcagc 
tgaannaact 
antccctccn 
cnccccncgg 
gggcntntan 



tcgcctctgc 
aagccnatca 
cagggttggt 
ccctctgcca 
ttgaccgtng 
ttgncnntcn 
ngtttggntt 
cnccccctnt 



tgcctgcaaa 
cacccccctc 
ggcannccgg 
gctgttntat 
gaatagccgc 
cctcgggcca 
tttcatnggg 
ggtcccntng 



aaacttgctt 
cctggactcc 
gcccntgcgc 
tccttggggg 
gcntcnccnt 
ttctggattt 
ccccaactct 
ggcc 



<210> 36 

<211> 814 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (814) 
<223> n = A,T,C or G 



<400 
cggncgcttt 
cctagnaaac 
naacgccaac 
ggaaaggcct 
aatggaaaaa 
ctaaaacanc 
ggcttgatgg 
antganctgg 

sggggsngtc 

gcccctgaac 
cttccggtct 
tgtnttggac 



> 36 
ccngccgcgc 
attaatgggt 
tcaggccatt 
gccttgtaag 
aaaaataaac 
ccagcgctca 
tatcactgcc 
aaggcctgaa 
ntttncagtg 
ganatgcttc 
gatccnaaag 
ccntgctngn 



ggngaactca agaaggtctn 



cccgtttcca 
tgctctacta 
cctaccaaag 
acaccacaat 
aanaggtttt 
cttctgcttg 
acntttccac 
ncttagtctc 
gatctgccaa 
cancancctt 
gaatgttcct 
atnacccaan 
ot gcoct aOii 
ngaaaaacca 



tgacnaaggc 
atacatcata 
gaagaaaggc 
ncggctgaat 
gttctcatgg 
ganaaatatt 
ccagctgggc 
caaaagtctc 
anantacccn 
taagacccat 
gggtcccant 
tganatcccc 
nctgaaagca 
cncn 



tcccttcang 
cnaaccagta 
tggtctctcc 
ctnaagtctt 
ctgcccaccg 
ctttgctctt 
ncccttcccc 
ngcccacaag 
tatcatcnnt 
aatcctngaa 
ccctcctttg 
ngaagcaccc 



ttaaatacnn 
agcctgccca 
accccctgta 
gtgttttact 
cagcctggca 
ttggacatca 
catntttgtc 
accggccacc 
gaataaaaag 
ccatggtgcc 
ttncttacgt 
tncccctggc 



cna i_ uCv_,c i,n gg\_^riucriac±n 



<210> 37 

<211> 760 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (760) 
<223> n = A,T,C or G 



<400 
gcatgctgct 
gcgcagtgtt 
gtgtctggca 
tcnaanccac 
gtgtcgtcac 
gggctgacag 
cncctnancc 
actcttcttc 
ttgcaaaatc 
ganccncctt 
caattgaact 
actggaaaaa 



> 37 
cttcctcaaa 
cgctgaaggg 
ggtccacgca 
tcgtgtattt 
actccactaa 
gtgccagaac 
caaactgcct 
ccaaaggtag 
tgctccgtgg 
gtttgaatgc 
gttaacnttg 
ggtangtgcc 



gttgttcttg 
gttgtagtac 
atgccctttg 
ttcacangca 
actgtcgatn 
acactggatn 
ctcaaaggcc 
ttgttcttgt 
gggtcatnnn 
naaggnaata 
ggccgngttc 
ttccttgaat 



ttgccataac 
cagcgcggga 
tcactgggga 
gcctcctccg 
cancagccca 
ggcctttcca 
accttgcaca 
tgcccaagca 
taccanggtt 
atcctcctgt 
cnctngggtg 
tcccaaantt 



aaccaccata 
tgctctcctt 
aatggatgcg 
aagcntccgg 
ttgctgcagc 
tggaagggcc 
ccccgacagg 
ncctccanca 
ggggaaanaa 
cttgcttggg 
gtctgaaact 
cccctngntt 



ggtaaagcgg 
gcagagtcct 
ctggagctcg 
gcagttgggg 
ggaactgggt 
tgggggaaat 
ctagaaatgc 
aaccaaaanc 
acccggcngn 
tggaanagca 
aatcaccgtc 
tgggtnnttt 



420 
480 
540 
600 
660 
720 
780 
834 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
814 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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ci.cc\.ctncc ctaaaaatcg rnttcccccc ccntanggcg 760 

<2iC> 38 

<211> 724 

<212> OKA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> ( 1) . . . (724 ) 
<223> n = A, T, C or G 



<400> 38 

tttttttttt tttttttttt tttttttttt tttttaaaaa ccccctccat tgaatgaaaa 60 

cttccnaaat tgtccaaccc cctcnnccaa atnnccattt ccgggggggg gttccaaacc 120 

caaattaatt ttgganttta aattaaatnt tnattngggg aanaanccaa atgtnaagaa 180 

aatttaaccc attatnaact taaatncctn gaaacccntg gnttccaaaa atttttaacc 240 

cttaaatccc tccgaaattg ntaanggaaa accaaattcn cctaaggctn tttgaaggtt 300 

ngatttaaac ccccttnant tnttttnacc cnngnctnaa ntatttngnt tccggtgttt 360 

tcctnttaan cntnggtaac tcccgntaat gaannnccct aanccaatta aaccgaattt 420 

tttttgaatt ggaaattccn ngggaattna ccggggtttt tcccntttgg gggccatncc 480 

cccnctttcg gggtttgggn ntaggttgaa tttttnnang ncccaaaaaa ncccccaana 540 

aaaaaactcc caagnnttaa ttngaatntc ccccttccca ggccttttgg gaaaggnggg 600 

Ittntggggg ccngggantt cnttcccccn ttnccncccc ccccccnggt aaanggttat 660 

ngnntttggt ttttgggccc cttnanggac cttccggatn gaaattaaat ccccgggncg 720 

gccg '724 



<210> 39 

<211> 751 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (751) 
<223> n ^ A,T,C or G 



<400> 39 

tttttttttt tttttctttg ctcacattta atttttattt tgattttttt taatgctgca 60 

caacacaata tttatttcat ttgtttcttt tatttcattt tatttgtttg ctgctgctgt 120 

tttatttatt tttactgaaa gtgagaggga acttttgtgg ccttttttcc tttttctgta 180 

ggccgcctta agctttctaa atttggaaca tctaagcaag ctgaanggaa aagggggttt 24 0 

cgcaaaatca ctcgggggaa nggaaaggtt gctttgttaa tcatgcccta tggtgggtga 300 

ttaactgctt gtacaattac ntttcacttt taattaattg tgctnaangc tttaattana 360 

cttgggggtt ccctccccan accaaccccn ctgacaaaaa gtgccngccc tcaaatnatg 420 

tcccggcnnt cnttgaaaca cacngcngaa ngttctcatt ntccccncnc caggtnaaaa 4 80 

tgaagggtta ccatntttaa cnccacctcc acntggcnnn gcctgaatcc tcnaaaancn 540 

ccctcaancn aattnctnng ccccggtcnc gcntnngtcc cncccgggct ccgggaantn 600 

cacccccnga anncnntnnc naacnaaatt ccgaaaatat tcccnntcnc tcaattcccc 660 

cnnagactnt cctcnncnan cncaattttc ttttnntcac gaacncgnnc cnnaaaatgn 720 

nnnncncctc cnctngtccn naatcnccan c 751 



<210> 40 

<211> 753 

<212> DNA 

<213> Homo sapien 

<220> 
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<221> inisc_f eature 
<222> (1) . . . (753) 
<223> n = A,T,C or G 



<400> 40 

gtggtatttt ctgtaagatc aggtgttcct 
agatgaaaac ccccccgaga cagcagcact 
cgccctatgc acagctgggc ccttgagaca 
tggtctggaa gcggcggctg tacctgcgta 
tctcaaagtt ccaggcaacn tcgttgcgac 
cggtcataan cgcggtggcg tcgtcgctgg 
ataaaaggtg cgcccccgca ccgttcanct 
cnaacccacc accannccgg acttccttga 
ttctnctgat gccctanctg gttgcccngn 
aaancacccn cctcctcntt tcatctgggt 
ggancccata tctcnaccan tactcaccnt 
ttcccncccg ncctctggcc cntcaaanan 
tnccctatct gnaccccncn tttgtctcan 

<210> 41 

<211> 341 

<212> DNA 

<213> Homo sapien 



ccctcgtagg 
gcaactgcca 
gcagggcttc 
ggggcacacc 
acaccggaga 
gagctggcag 
cgcacttctc 
nggaattccc 
atgccaanca 
tnttntcccc 
ncccccccnt 
gcttncacna 
tnt 



tttagaggaa 
agcagccggg 
gatgtcaggc 
gtcagggccc 
ccaggtgatn 
ggcctcccgc 
naanaccatg 
aaatctcttc 
nccccaancc 
ggaccntggt 
gnnacccanc 
cctgggtctg 



acaccctcat 
gtaggagggg 
tcgatgtcaa 
accaggaact 
agcttggggt 
aggaaggcna 
angttgggct 
gntcttgggc 
ccggggtcct 
tcctctcaag 
cttctanngn 
ccttcccccc 



cccatagact 
atgttctcag 
aaaataggaa 
gtctacattc 
ttgagaatat 
attagggtag 



tcttgacata 
tattttggga 
tttgtgaaga 
aatccagaca 
tctttcagag 
t 



gcttcaaatg 
gcctttccac 
gttaaaaagt 
cttagttgag 
gtattttcat 



<400> 41 

actatatcca tcacaacaga catgcttcat 
agtgaaccca tccttgattt atatacatat 
ttctttaaac cttgttcatt atgaacactg 
tatagcttgt ttacgtagta agtttttgaa 
tgttaaactg tgatttttaa aaaatatcat 
ttttactttt tgattaattg tgttttatat 

<210> 42 

<211> 101 

<212> DNA 

<213> Homo sapien 

<400> 42 

acttactgaa tttagttctg tgctcttcct tatttagtgt tgtatcataa atactttgat 
gtttcaaaca ttctaaataa ataattttca gtggcttcat a 

<210> 43 

<211> 305 

<212> DNA 

<213> Homo sapien 

<400> 43 

acatctttgt tacagtctaa gatgtgttct taaatcacca ttccttcctg gtcctcaccc 
tccagggtgg tctcacactg taattagagc tattgaggag tctttacagc aaattaagat 
tcagatgcct tgctaagtct agagttctag agttatgttt cagaaagtct aagaaaccca 
cctcttgaga ggtcagtaaa gaggacttaa tatttcatat ctacaaaatg accacaggat 
tggatacaga acgagagtta tcctggataa ctcagagctg agtacctgcc cgggggccgc 
tcgaa 

<210> 44 

<211> 852 

<212> DNA 

<213> Homo sapien 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
753 



60 
120 
180 
240 
300 
341 



60 
101 



60 
120 
180 
240 
300 
305 
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<220> 

<221> mis cofeature 
<222> (1) . . . (852) 
<223> n =- A,T,C or G 



<4CC> 44 

acataaatat caqagaaaag tagtctttga aatatttacq tccaggagtt ctttgtttct 60 

gattatttgg tgtgtgtT:tt ggtttgtgtc caaagtattg gcagcttcag ttttcatttr 120 

ctctccatcc tcgggcattc ttcccaaatt tatataccag tcttcgtcca tccacacgct 180 

-cagaatttc tcttttgtag taatatctca tagctcggct gagcttttca taggtcatgc 240 

tgctgttgtt cttcttttta ccccatagct gagccactgc ctctgatttc aagaacctga 300 

agacgccctc agatcggtct tcccatttta ttaatcctgg gttcttgtct gggttcaaga 360 

ggatgtcgcg gatgaattcc cataagtgag tccctctcgg gttgtgcttt ttggtgtggc 420 

acttggcagg ggggtcttgc tcctttttca tatcaggtga ctctgcaaca ggaaggtgac 480 

tggtggttgt catggagatc tgagcccggc agaaagtttt gctgtccaac aaatctactg 540 

tgctaccata gttggtgtca tataaatagt tctngtcttt ccaggtgttc atgatggaag 600 

gctcagtttg ttcagtcttg acaatgacat tgtgtgtgga ctggaacagg tcactactgc 660 

actggccgtt ccacttcaga tgctgcaagt tgctgtagag gagntgcccc gccgtccctg 720 

ccgcccgggt gaactcctgc aaactcatgc tgcaaaggtg ctcgccgttg atgtcqaact 780 

cntggaaagg gatacaattg gcatccagct ggttggtgtc caggaggtga tggagccact 840 

cccacacctg gt 852 



<2a0> 45 

<211> 234 

<212> DNA 

<213> Homo sapien 



<400> 45 

acaacagacc cttgctcgct aacgacctca tgctcatcaa gttggacgaa tccgtgtccg 60 

agtctgacac catccggagc atcagcattg cttcgcagtg ccctaccgcg gggaactctt 120 

gcctcgtttc tggctggggt ctgctggcga acggcagaat gcctaccgtg ctgcagtgcg 180 

t aaacgtgtc ggtggtgtct gaggaggtct gcagtaagct ctatgacccg ctgt 234 



<210> 46 

<211> 590 

<212> DNA 

<213> Homo sapien 

<220> 

<221> niisc_f eature 
<222> (1) . . . (590) 
<223> n - A, T, C or G 



<400> 46 

actttttatt taaatgttta taaggcagat ctatgagaat gatagaaaac atggtgtgta 60 

atttgatagc aatattttgg agattacaga gttttagtaa ttaccaatta cacagttaaa 120 

aagaagataa tatattccaa gcanatacaa aatatctaat gaaagatcaa ggcaggaaaa 180 

tgantataac taattgacaa tggaaaatca attttaatgt gaattgcaca ttatccttta 240 

aaagctttca aaanaaanaa ttattgcagt ctanttaatt caaacagtgt taaatggtat 300 

caggataaan aactgaaggg canaaagaat taattttcac ttcatgtaac ncacccanat 360 

ttacaatggc ttaaatgcan ggaaaaagca gtggaagtag ggaagtantc aaggtctttc 420 

1 qgtctctaa tctgccttac tctttgggtg tggctttgat cctctggaga cagctgccag 480 

ggctcctgtt atatccacaa tcccagcagc aagatgaagg gatgaaaaag gacacatgct 540 

gccttccttt gaggagactt catctcactg gccaacactc agtcacatgt 590 



<210> 47 
<211> 774 



w o 01/51633 



PCT/IIS()I/01574 



<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> {!).., (774) 
<223> n = A, T,C or G 



<400> 47 

acaagggggc ataatgaagg agtggggana gattttaaag aaggaaaaaa aacgaggccc 60 

tgaacagaat tttcctgnac aacggggctt caaaataatt ttcttgggga ggttcaagac 120 

gcttcactgc ttgaaactta aatggatgtg ggacanaatt ttctgtaatg accctgaggg 180 

cattacagac gggactctgg gaggaaggat aaacagaaag gggacaaagg ctaatcccaa 24 0 

aacatcaaag aaaggaaggt ggcgtcatac ctcccagcct acacagttct ccagggctct 300 

cctcatccct ggaggacgac agtggaggaa caactgacca tgtccccagg ctcctgtgtg 360 

ctggctcctg gtcttcagcc cccagctctg gaagcccacc ctctgctgat cctgcgtggc 420 

ccacactcct tgaacacaca tccccaggtt atattcctgg acatggctga acctcctatt 480 

cctacttccg agatgccttg ctccctgcag cctgtcaaaa tcccactcac cctccaaacc 540 

acggcatggg aagcctttct gacttgcctg attactccag catcttggaa caatccctga 600 

ttccccactc cttagaggca agatagggtg gttaagagta gggctggacc acttggagcc 660 

aggctgctgg cttcaaattn tggctcattt acgagctatg ggaccttggg caagtnatct 720 

tcacttctat gggcntcatt ttgttctacc tgcaaaatgg gggataataa tagt 774 



<210> 48 

<211> 124 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (124) 
<223> n - A, T,C or G 



<400> 48 

canaaattga aattttataa aaaggcattt ttctcttata tccataaaat gatataattt 60 

ttgcaantat anaaatgtgt cataaattat aatgttcctt aattacagct caacgcaact 120 

tggt 3^24 



<210> 49 

<211> 147 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (147) 
<223> n = A, T;.C or G 



<400> 49 

gccgatgcta ctattttatt gcaggaggtg ggggtgtttt tattattctc tcaacagctt 60 

tgtggctaca ggtggtgtct gactgcatna aaaanttttt tacgggtgat tgcaaaaatt 120 

ttagggcacc catatcccaa gcantgt 147 



<210> 50 

<211> 107 

<212> DNA 

<213> Homo sapien 
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<4C0> 50 

acattaaatt aataaaagga ctgttgggqt rctgctaaaa cacatggctt gatatattgc 60 
gat. 1- tgag cttaggagga gctaggcara tgrtttggga gaggggt 107 

<210> 51 

<-211> 204 

<212> DNA 

<213> Homo sapien 



<400> 51 

:]t cctaggaa gtctagggga cacacgactc tggggtcacg gggccgacac acttgcacgg 60 

jgggaaggaa aggcagagaa gtgacaccgt cagggggaaa tgacagaaag gaaaatcaag 120 

gccttgcaag gtcagaaagg ggactcaggg cttccaccac agccctgccc cacttggcca 180 

cci -^^cct ttt gggaccagca atgt 204 



<210> 52 

<211> 491 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) ... (491) 
<223> n = A, T, C or G 



<400> 52 

acaaagataa catttatctt ataacaaaaa tttgatagtt ttaaaggtta gtattgtgta 60 

gggtattttc caaaagacta aagagataac tcaggtaaaa agttagaaat gtataaaaca 120 

ccatcagaca ggtttttaaa aaacaacata ttacaaaatt agacaatcat ccttaaaaaa 180 

aaaacttctt gtatcaattt cttttgttca aaatgactga cttaantatt tttaaatatt 240 

tcanaaacac ttcctcaaaa attttcaana tggtagcttt canatgtncc ctcagtccca 300 

ataltgctca gataaataaa tctcgtgaga acttaccacc caccacaagc tttctggggc 360 

'rr..iacagt gtcttttctt tnctttttct tttttttttt ttacaggcac agaaactcat 420 

r'\-.,4-t t ttatt tggataacaa agggtctcca aattatattg aaaaataaat ccaagttaat 480 

atcactcttg t 491 



<210> 53 

<211> 484 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1 ) ... (484 ) 
<223> n = A, T, C or G 



<400> 53 

acataattta gcagggctaa ttaccataag atgctattta ttaanaggtn tatgatctga 60 

gtattaacag ttgctgaagt ttggtatttt tatgcagcat tttctttttg ctttgataac 120 

actacagaac ccttaaggac actgaaaatt agtaagtaaa gttcagaaac attagctgct 180 

caatcaaatc tctacataac actatagtaa ttaaaacgtt aaaaaaaagt gttgaaatct 240 

qcactagtat anaccgctcc tgtcaggata anactgcttt ggaacagaaa gggaaaaanc 300 

agctttgant ttctttgtgc tgatangagg aaaggctgaa ttaccttgtt gcctctccct 360 

aatgattggc aggtcnggta aatnccaaaa catattccaa ctcaacactt cttttccncg 420 

^^ancttgant ctgtgtattc caggancagg cggatggaat gggccagccc ncggatgttc 480 

r-ant 4 84 



<210> 54 
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<211> 151 
<212> DNA 
<213> Homo sapien 

<400> 54 

actaaacctc gtgcttgtga actccataca gaaaacggtg ccatccctga acacggctgg 60 

ccactgggta tactgctgac aaccgcaaca acaaaaacac aaatccttgg cactggctag 120 

tctatgtcct ctcaagtgcc tttttgtttg t 151 

<210> 55 

<211> 91 

<212> DNA 

<213> Homo sapien 

<400> 55 

acctggcttg tctccgggtg gttcccggcg ccccccacgg tccccagaac ggacactttc 60 
gccctccagt ggatactcga gccaaagtgg t 91 

<210> 56 

<211> 133 

<212> DNA 

<213> Homo sapien 

<400> 56 

ggcggatgtg cgttggttat atacaaatat gtcattttat gtaagggact tgagtatact 60 

tggatttttg gtatctgtgg gttgggggga cggtccagga accaataccc catggatacc 120 

aagggacaac tgt 133 

<210> 57 

<211> 147 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (147) 
<223> n - A,T,C or G 



<400> 57 

actctggaga acctgagccg ctgctccgcc tctgggatga ggtgatgcan gcngtggcgc 60 

gactgggagc tgagcccttc cctttgcgcc tgcctcagag gattgttgcc gacntgcana 120 

tctcantggg ctggatncat gcagggt 147 



<210> 58 

<211> 198 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (198) 
<223> n = A, T,C or G 



<400> 58 

acagggatat aggtttnaag ttattgtnat tgtaaaatac attgaatttt ctgtatactc 60 

tgattacata catttatcct ttaaaaaaga tgtaaatctt aatttttatg ccatctatta 120 

atttaccaat gagttacctt gtaaatgaga agtcatgata gcactgaatt ttaactagtt 180 

ttgacttcta agtttggt 198 
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<2lO> 59 

<211> 330 

<212> DNA 

<213> Hon:io sapien 

<400> 59 

^^ciucaaatg ggttgtgagg aagtcttatc agcaaaactg gtgatggcta ctgaaaagat 60 

ccattgaaaa ttatcattaa tgattttaaa tgacaagtta tcaaaaactc actcaatttt 120 

cacctgtgct agcttgctaa aatgggagtt aactctagag caaatatagt atcttctgaa 180 

f o-^.igtcaat aaatgacaaa gccagggcct acaggtggtt tccagacttt ccagacccag 240 

::agdaggaat ctattttatc acatggatct ccgtctgtgc tcaaaatacc taatgatatt 300 

tttcgtcttt attggacttc tttgaagagt 330 

<210> 60 

<211> 175 

<212> DNA 

<213> Homo sapien 

<400> 60 

accgtgggtg ccttctacat tcctgacggc tccttcacca acatctggtt ctacttcggc 60 

gtcqtgggct ccttcctctt catcctcatc cagctggtgc tgctcatcga ctttgcgcac 120 

Icr^tqgaacc agcggtggct gggcaaggcc gaggagtgcg attcccgtgc ctggt 175 

<210> 61 

<211> 154 

<212> DNA 

<213> Homo sapien 



<400> 61 

accccacttt tcctcctgtg agcagtctgg acttctcact gctacatgat gagggtgagt 60 

qqi.tgttgct cttcaacagt atcctcccct ttccggatct gctgagccgg acagcagtgc 120 

rqqnr-tgcac agccccgggg ctccacattg ctgt 154 



<210> 62 

<211> 30 

<:212> DNA 

<213> Homo sapien 

<400> 62 

cgctcgagcc ctatagtgag tcgtattaga 30 

<210> 63 

<211> 89 

<212> DNA 

<213> Homo sapien 

<400> 63 

dcaagtcatt tcagcaccct ttgctcttca aaactgacca tcttttatat ttaatgcttc 60 
ctgtatgaat aaaaatggtt atgtcaagt 89 

<210> 64 

<211> 97 

<212> DNA 

<213> Homo sapien 

<400> 64 

acoggagtaa ctgagtcggg acgctgaatc tgaatccacc aataaataaa ggttctgcag 60 
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aatcagtgca tccaggattg gtccttggat ctggggt 



97 



<210> 65 

<211> 377 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (377) 
<223> n - A,T,C or G 



<400> 65 

acaacaanaa ntcccttctt taggccactg 
gcatggcgtc ctaggccttg acacagcggc 
ccaaccctgg tctacccaca nttctggcta 
tcggtcataa natgaaatcc caanggggac 
ggtgctgttt gctcagccag aaaacagctg 
tgggggtgaa ctacccccan gaggaatcat 
gggcgggagg agcatgt 

<210> 66 

<211> 305 

<212> DNA 

<213> Homo sapien 

<400> 66 

acgcctttcc ctcagaattc agggaagaga 
agaacccgtg tgccccttcc caccatatcc 
aggaactaac tgcaccctgg tcctctcccc 
tcctccactc taagggatat caacactgcc 
ttatatattt tttaataaga tgcactttat 
tgttt 

<210> 67 

<211> 385 

<212> DNA 

<213> Homo sapien 



atggaaacct 
tggggtttgg 
tgggctgtct 
agaggtcagt 
cctggcattc 
gcctgggcga 



ggaaccccct 
gctntcccaa 
ctgccactga 
agaggaagct 
gccgctgaac 
tgcaanggtg 



tttgatggca 
accgcacacc 
acatcagggt 
caatgagaaa 
tatgaacccg 
ccaacaggag 



ctgtcgcctg ccttcctccg 
accctcgctc catctttgaa 
agtccccagt tcaccctcca 
cagcacaggg gccctgaatt 
gtcatttttt aataaagtct 



ttgttgcgtg 
ctcaaacacg 
tccctcacct 
tatgtggttt 
gaagaattac 



60 
120 
180 
240 
300 
360 
377 



60 
120 
180 
240 
300 
305 



<400 
actacacaca 
ggtcggacca 
cccttttaaa 
tgtgctgtgc 
ctgggcagtc 
cctctcccag 
catagtttct 



> 67 
ctccacttgc 
gccacat etc 
aaaggggact 
tggagattca 
ttgcacatga 
ggccccagcc 
gtgctagtgg 



cct tgtgaga 
atgtgcaaga 
tgcttaaaaa 
cttttgagag 
gatggggctg 
tggccacacc 
accgt 



cactttgtcc 
ttgcccagca 
agaagtctag 
agttctcctc 
gtctgatctc 
tgcttacagg 



cagcacttta 
gacatcaggt 
ccacgattgt 
tgagacctga 
agcactcctt 
gcactctcag 



ggaatgctga 
ctgagagttc 
gtagagcagc 
tctttagagg 
agtctgcttg 
atgcccatac 



<210> 68 

<211> 73 

<212> DNA 

<213> Homo sapien 

<400> 68 

acttaaccag atatattttt accccagatg gggatattct ttgtaaaaaa tgaaaataaa 
gtttttttaa tgg 



60 
120 
180 
240 
300 
360 
385 



60 
73 



<210> 69 
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<211> 536 
<212> DNA 
<213> Home sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (536) 
<223> a - A,T,C or G 



<400> 69 

-.■^•V.agt ccag tgtggtggaa ttccattgtg ttgggggctc tcaccctcct ctcctgcagc 60 

tccagctttg tgctctgcct ctgaggagac catggcccag catctgagta ccctgctgct 120 

cctgctqgcc accctagctg tggccctggc ctggagcccc aaggaggagg ataggataat 180 

,::ccgggtggc ^tctataacg cagacctcaa tgatgagtgg gtacagcgtg cccttcactt 240 

cgccatcagc qagtataaca aggccaccaa agatgactac tacagacgtc cgctgcgggt 300 

actaagagcc aggcaacaga ccgttggggg ggtgaattac ttcttcgacg tagaggtggg 360 

ccgaaccata Igtaccaagt cccagcccaa cttggacacc tgtgccttcc atgaacagcc 420 

agaactgcaa aaaaaacagt tgtgctcttt cgagatctac gaagttccct ggggagaaca 480 

qaangtccct: qg.itg^a^tc caggtgtcaa gaaatcctan ggatctgttg ccaggc 536 



<210> 70 

<211> 4 77 

<212> DNA 

<213> Hono i^apien 



<400> 70 

atgaccccta acaggggccc tctcagccct cctaatgacc tccggcctag ccatgtgatt 60 

tcacttccac tccataacgc tcctcatact aggcctacta accaacacac taaccatata 120 

ccaatgatgg cgcgatgtaa cacgagaaag cacataccaa ggccaccaca caccacctgt 180 

ccaaaaaggc cttcgatacg ggataatcct atttattacc tcagaagttt ttttcttcgc 240 

agggattttt ctgagccttt taccactcca gcctagcccc taccccccaa ctaggagggc 300 

actggccccc aacaggcatc accccgctaa atcccctaga agtcccactc ctaaacacat 360 

ccgtattact cgcatcagga gtatcaatca cctgagctca ccatagtcta atagaaaaca 420 

.:iccgaaacca aattattcaa agcactgctt attacaattt tactgggtct ctatttt 477 



<210> 71 

<211> 533 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (533) 
<223> n = A, T, C or G 



<400> 71 

agagctatag gtacagtgtg atctcagctt tgcaaacaca ttttctacat agatagtact 60 

aggtattaat agatatgtaa agaaagaaat cacaccatta ataatggtaa gattggttta 120 

tgtgatttta gtggtatttt tggcaccctt atatatgttt tccaaacttt cagcagtgat 180 

attatttcca taacttaaaa agtgagtttg aaaaagaaaa tctccagcaa gcatctcatt 240 

taaataaagg tttgtcatct ttaaaaatac agcaatatgt gactttttaa aaaagctgtc 300 

aaataggtgt gaccctacta ataattatta gaaatacatt taaaaacatc gagtacctca 360 

agtcagtttg ccttgaaaaa tatcaaatat aactcttaga gaaatgtaca taaaagaatg 420 

cttcgtaatt ttggagtang aggttccctc ctcaattttg tatttttaaa aagtacatgg 480 

taaaaaaaaa aattcacaac agtatataag gctgtaaaat gaagaattct gcc 533 



<2 1 0> 7 2 
<211> 511 
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<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__feature 

<222> (1) . . . (511) 
<223> n = A, T,C or G 



<400> 72 

tattacggaa aaacacacca cataattcaa ctancaaaga anactgcttc agggcgtgta 60 

aaatgaaagg cttccaggca gttatctgat taaagaacac taaaagaggg acaaggctaa 120 

aagccgcagg atgtctacac tatancaggc gctatttggg ttggctggag gagctgtgga 180 

aaacatggan agattggtgc tgganatcgc cgtggctatt cctcattgtt attacanagt 240 

gaggttctct gtgtgcccac tggtttgaaa accgttctnc aataatgata gaatagtaca 300 

cacatgagaa ctgaaatggc ccaaacccag aaagaaagcc caactagatc ctcagaanac 360 

gcttctaggg acaataaccg atgaagaaaa gatggcctcc ttgtgccccc gtctgttatg 420 

atttctctcc attgcagcna naaacccgtt cttctaagca aacncaggtg atgatggcna 480 

aaatacaccc cctcttgaag naccnggagg a 513^ 

<210> 73 

<211> 499 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (499) 
<223> n = A, T,C or G 



<400> 73 

cagtgccagc actggtgcca gtaccagtac caataacagt gccagtgcca gtgccagcac 60 

cagtggtggc ttcagtgctg gtgccagcct gaccgccact ctcacatttg ggctcttcgc 120 

..ggccttggt ggagctggtg ceagcaccag tggcagctct ggtgcctgtg gtrtctccta 18 0 

caagtgagat tttagatatt gttaatcctg ccagtctttc tcttcaagcc agggtgcatc 240 

ctcagaaacc tactcaacac agcactctag gcagccacta tcaatcaatt gaagttgaca 300 

ctctgcatta aatctatttg ccatttctga aaaaaaaaaa aaaaaaaggg cggccgctcg 360 

antctagagg gcccgtttaa acccgctgat cagcctcgac tgtgccttct anttgccagc 420 

catctgttgt ttgcccctcc cccgntgcct tccttgaccc tggaaagtgc cactcccact 480 

gtcctttcct aantaaaat 49^ 

<210> 74 

<211> 537 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (537) 
<223> n = A,T,C or G 



<400> 74 

tttcatagga gaacacactg aggagatact 
ttatcagctt aactcagata aaatcattga 
tccaggccca cggctcaagt gaatttgaat 
cattgtatgc atggaaacat ggaggaacag 
aaagaattac agactctgat tctacagtga 
ggcttttgat ttataanact ttgggtactt 
cagtttgctt gatatatttg ttgatattaa 



tgaagaattt ggattcagcc gcgaagagat 60 

aagtaataag gtaaaagcta gtctctaact 120 

actgcattta cagtgtagag taacacataa 180 

tattacagtg tcctaccact ctaatcaaga 240 

tgattgaatt ctaaaaatgg taatcattag 300 

atactaaatt atggtagtta tactgccttc 360 

gattcttgac ttatattttg aatgggttct 420 
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cictgaaaaan gaatgatata ttcttqaaga catcgatata catttattta cactcttqat 480 
tctacaatgt agaaaatgaa ggaaatgccc caaattgtat ggtgataaaa gtcccgt 537 

<210> 75 

<211> 467 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> (1) . . . (467) 
<223> n = A,T,C or G 



<400> 75 

caaanacaat tgttcaaaag atgcaaatga tacactactg ctgcagctca caaacacctc 60 

tgcatattac acgtacctcc tcctgctcct caagtagtgt ggtctatttt gccatcatca 120 

cctgctgtct gcttagaaga acggctttct qctgcaangq agagaaatca taacagacgg 180 

tggcacaagg aggccatctt ttcctcatcg gttattgtcc ctagaagcgt cttctgagga 240 

tctagttggg ctttctttct gggtttgggc catttcantt ctcatgtgtg tactattcta 300 

tcattattgt ataacggttt tcaaaccngt gggcacncag agaacctcac tctgtaataa 360 

caatgaggaa tagccacggt gatctccagc accaaatctc tccatgttnt tccagagctc 420 

ctccagccaa cccaaatagc cgctgctatn gtgtagaaca tccctgn 4 67 



<210> 76 

<211> 400 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (400) 
<223> n - A,T,C or G 



<400> 76 

aagctgacag cattcgggcc gagatgtctc gctccgtggc cttagctgtg ctcgcgctac 60 

tctctctttc tggcctggag gctatccagc gtactccaaa gattcaggtt tactcacgtc 120 

atccagcaga gaatggaaag tcaaatttcc tgaattgcta tgtgtctggg tttcatccat 180 

ccgacattga agttgactta ctgaagaatg gagagagaat tgaaaaagtg gagcattcag 240 

acttgtcttt cagcaaggac tggtctttct atctcttgta ctacactgaa ttcaccccca 300 

ctgaaaaaga tgagtatgcc tgccgtgtga accatgtgac tttgtcacag cccaagatng 360 

ttnagtggga tcganacatg taagcagcan catgggaggt 400 



<210> 77 

<211> 248 

<212> DNA 

<213> Homo sapien 



<400> 77 

ctggagtgcc ttggtgtttc aagcccctgc aggaagcaga atgcaccttc tgaggcacct 60 

ccagctgccc cggcggggga tgcgaggctc ggagcaccct tgcccggctg tgattgctgc 120 

caggcactgt tcatctcagc ttttctgtcc ctttgctccc ggcaagcgct tctgctgaaa 180 

gttcatatct ggagcctgat gtcttaacga ataaaggtcc catgctccac ccgaaaaaaa 240 

aaaaaaaa 248 



<210> 78 

<211> 201 

< 2 1 2 > DNA. 

<213> Homo sapien 
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<400> 78 

actagtccag tgtggtggaa ttccattgtq ttgggcccaa cacaatggct acctttaaca 60 

tcacccagac cccgccctgc ccgtgcccca cgctgctgct aacgacagta tgatgcttac 120 

tctgctactc ggaaactatt tttatgtaat taatgtatgc tttcttgttt ataaatgcct 180 

gatttaaaaa aaaaaaaaaa a 201 

<210> 79 

<211> 552 

<212> DNA 

<213> Homo sapien 



<220> 

<221> nisc_feature 
<22?> ( 1) . . . (552) 
<223> n - A,T,C or G 



<400> 79 

tccttttgtt aggtutttga gacaacccta gacctaaact gtgtcacaga 
tttaggcagt gctdgtaatt tcctcgtaat gattctgtta ttactttcct 
cctctttcct ctqnagatta atgaagttga aaattgaggt ggataaatac 
tgtgatagta taagtatcta agtgcagatg aaagtgtgtt atatatatcc 
atgcaagtta gtaattactc agggttaact aaattacttt aatatgctgt 
ctgttccttg gctagaaaaa attataaaca ggactttgtt agtttgggaa 
taatattcta tgttctaaaa gttgggctat acataaanta tnaagaaata 
ttcccaggaa tatggggttc atttatgaat antacccggg anagaagttt 
cngttttggt taatacgtta atatgtcctn aatnaacaag gcntgactta 
aaaaaaaaaa aa 



cttctgaatg 
attctttatt 
aaaaaggtag 
attcaaaatt 
tgaacctact 
gccaaattga 
tggaatttta 
tgantnaaac 
tttccaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
552 



<210> 80 

<211> 476 

<212> DNA 

<2i3> Homo sapien 



<220> 

<221> misc^feature 
<222> {!)... (476) 
<223> n = A, C or G 



<400> 80 

acagggattt gagatgctaa ggccccagag atcgtttgat ccaaccctct tattttcaga 60 

ggggaaaatg gggcctagaa gttacagagc atctagctgg tgcgctggca cccctggcct 120 

cacacagact cccgagtagc tgggactaca ggcacacagt cactgaagca ggccctgttt 180 

gcaattcacg ttgccacctc caacttaaac attcttcata tgtgatgtcc ttagtcacta 240 

aggttaaact ttcccaccca gaaaaggcaa cttagataaa atcttagagt actttcatac 300 

tcttctaagt cctcttccag cctcactttg agtcctcctt gggggttgat aggaantntc 360 

tcttggcttt ctcaataaaa tctctatcca tctcatgttt aatttggtac gcntaaaaat 420 

gctgaaaaaa ttaaaatgtt ctggtttcnc tttaaaaaaa aaaaaaaaaa aaaaaa 476 

<210> 81 

<211> 232 

<212> DNA 

<213> Homo sapien 



<220> 

<221> inisc_f eature 
<222> {!)... (232) 
<223> n - A, T,C or G 
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<400> 81 

t:LttLttttg tatgccntcn ctgtggngtt attgttgctg ccaccctgga ggagcccagt 60 

ttcttctgta tctttctttt ctgggggatc ttcctggctc tgcccctcca ttcccagcct 120 

ctcatcccca tcttgcactt ttgctagggt tggaggcgct ttcctggtag cccctcagag 180 

acrcagtcag cgggaataag tcctaggggt ggggggtgtg gcaagccggc ct 232 



<21C> 82 

<211> 383 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> {!)... (383) 
<223> n - A, T,C or G 



<400> 82 

aggcgggagc agaagctaaa gccaaagccc aagaagagtg gcagtgccag cactggtgcc 60 

agtaccagta ccaataacat gccagtgcca gtgccagcac cagtggtggc ttcagtgctg 120 

gtgccagcct gaccgccact ctcacatttg ggctcttcgc tggccttggt ggagctggtg 180 

ccaqcaccag tggcagctct ggtgcctgtq gtttctccta caagtgagat tttagatatt 240 

gttaatcctg ccagtctttc tcttcaagcc agggtgcatc ctcagaaacc tactcaacac 300 

agcactctng gcagccacta tcaatcaatt gaagttgaca ctctgcatta aatctatttg 360 

ccatttcaaa aaaaaaaaaa aaa 383 



<210> 83 

<211> 494 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> (1) . , . (494 ) 
<223> n = A, T, C or G 



<400> 83 

cicogaattgg gaccgctggc ttataagcga tcatgtcctc cagtattacc tcaacgagca 60 

gggagatcga gtctatacgc tgaagaaatt tgacccgatg ggacaacaga cctgctcagc 120 

ccatcctgct cggttctccc cagatgacaa atactctcga caccgaatca ccatcaagaa 180 

acgcttcaag gtgctcatga cccagcaacc gcgccctgtc ctctgagggt ccttaaactg 240 

atgtcttttc tgccacctgt tacccctcgg agactccgta accaaactct tcggactgtg 300 

agccctgatg cctttttgcc agccatactc tttggcntcc agtctctcgt ggcgattgat 360 

tatgcttgtg tgaggcaatc atggtggcat cacccatnaa gggaacacat ttganttttt 420 

tttcncatat tttaaattac naccagaata nttcagaata aatgaattga aaaactctta 480 

aaaaaaaaaa aaaa 494 



<210> 84 

<211> 380 

<212> DNA 

<213> Homo sapien 

<220> 

<221> rnisc_f eature 
<222> (1) . . . (380) 
<223> n = A, T, C or G 



<400> 84 
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gctggtagcc tatggcgtgg ccacggangg gctcctgagg cacgggacag tgacttccca 60 

agtatcctgc gccgcgtctt ctaccgtccc tacctgcaga tcttcgggca gattccccag 120 

gaggacatgg acgtggccct catggagcac agcaactgct cgtcggagcc cggcttctgg 180 

gcacaccctc ctggggccca ggcgggcacc tgcgtctccc agtatgccaa ctggctggtg 240 

gtgctgctcc tcgtcatctt cctgctcgtg gccaacatcc tgctggtcac ttgctcattg 300 

ccatgttcag ttacacattc ggcaaagtac agggcaacag cnatctctac tgggaaggcc 360 

agcgttnccg cctcatccgg ^BO 

<210> 85 

<211> 481 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (481) 
<223> n = A, T,C or G 

<400> 85 

gagttagctc ctccacaacc ttgatgaggt cgtctgcagt ggcctctcgc ttcataccgc 60 

tnccatcgtc atactgtagg tttgccacca cctcctgcat cttggggcgg ctaatatcca 120 

ggaaactctc aatcaagtca ccgtcnatna aacctgtggc tggttctgtc ttccgctcgg 180 

tgtgaaagga tctccagaag gagtgctcga tcttccccac acttttgatg actttattga 240 

gtcgattctg catgtccagc aggaggttgt accagctctc tgacagtgag gtcaccagcc 300 

ctatcatgcc nttgaacgtg ccgaagaaca ccgagccttg tgtggggggt gnagtctcac 360 

ccagattctg cattaccaga nagccgtggc aaaaganatt gacaactcgc ccaggnngaa 420 

aaagaacacc tcctggaagt gctngccgct cctcgtccnt tggtggnngc gcntnccttt 4 80 

^ 481 

<210> 86 
<211> 472 
<212> DNA 

-•^-1- — '- i^v^'iLti— o.ap-L.c;jii 

<220> 

<221> misc_f eature 
<222> (1) . . . (472) 
<223> n = A, T,C or G 

<400> 86 

aacatcttcc tgtataatgc tgtgtaatat cgatccgatn ttgtctgctg agaattcatt 60 

acttggaaaa gcaacttnaa gcctggacac tggtattaaa attcacaata tgcaacactt 120 

taaacagtgt gtcaatctgc tcccttactt tgtcatcacc agtctgggaa taagggtatg 180 

ccctattcac acctgttaaa agggcgctaa gcatttttga ttcaacatct ttttttttga 240 

cacaagtccg aaaaaagcaa aagtaaacag ttnttaattt gttagccaat tcactttctt 300 

catgggacag agccatttga tttaaaaagc aaattgcata atattgagct ttgggagctg 360 

atatntgagc ggaagantag cctttctact tcaccagaca caactccttt catattggga 420 

tgttnacnaa agttatgtct cttacagatg ggatgctttt gtggcaattc tg 472 

<210> 87 

<211> 413 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inxsc_feature 
<222> (1) . . , (413) 
<223> n - A,T,C or G 
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<400> 87 

-oaaaccagt atctctnaaa acaacctctc ataccttgtg aaccraattt tgtgtgcgtg 60 

rqtgLgtgcg cgcatattat atagacaggc acatcttttt tacttttgta aaagcttatg 120 

cctctttggt atctatatct gtgaaagttt taatgatctg ccataatgtc ttggggacct 180 

itgLcttctg tgtaaatggt actagagaaa acacctatnt tatgagtcaa tctagttngt 240 

^ t-tat- i"cgac atgaaggaaa tttccagatn acaacactna caaactctcc cttgactagg 300 

Ljoggacaaag aaaagcanaa ctgaacatna gaaacaattn cctggtgaga aattncataa 360 

acagaaattg ggtngtatat tgaaananng catcattnaa acgttttttt ttt 413 



<210> 88 

<211> 448 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (448) 
<223> n = A,T,C or G 



<400> 88 

cgcagcgggt cctctctatc tagctccagc ctctcgcctg ccccactccc cgcgtcccgc 60 

gtcctagccn accatggccg ggcccctgcg cgccccgctg ctcctgctgg ccatcctggc 120 

cgtggccctg gccgtgagcc ccgcggccgg ctccagtccc ggcaagccgc cgcgcctggt 180 

gggaggccca tggaccccgc gtggaagaag aaggtgtgcg gcgtgcactg gactttgccg 240 

tcggcnanta caacaaaccc gcaacnactt ttaccnagcn cgcgctgcag gttgtgccgc 300 

Gccaancaaa ttgttactng gggtaantaa ttcttggaag ttgaacctgg gccaaacnng 360 

tttaccagaa ccnagccaat tngaacaatt ncccctccat aacagcccct tttaaaaagg 420 

gaancantcc tgntcttttc caaatttt 448 



<210> 89 

<211> 463 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (463) 
<223> n = A, C or G 



<400> 89 

gaattttgtg cactggccac tgtgatggaa ccattgggcc aggatgcttt gagtttatca 60 

gtagtgattc tgccaaagtt ggtgttgtaa catgagtatg taaaatgtca aaaaattagc 120 

aqaggtctag gtctgcatat cagcagacag tttgtccgtg tattttgtag ccttgaagtt 180 

ctcagtgaca agttnnttct gatgcgaagt tctnattcca gtgttttagt cctttgcatc 240 

tttnatgttn agacttgcct ctntnaaatt gcttttgtnt tctgcaggta ctatctgtgg 300 

Ittaacaaaa tagaannact tctctgcttn gaanatttga atatcttaca tctnaaaatn 360 

aattctctcc ccatannaaa acccangccc ttggganaat ttgaaaaang gntccttcnn 420 

aattcnnana anttcagntn tcatacaaca naacngganc ccc 463 



<210> 90 

<211> 400 

<212> DNA 

<213> Homo sapien 

<220> 

<r221> misc feature 
<222> (1) .7. (400) 
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<223> n - A,T,C or G 



<400> 90 

agggattgaa ggtctnttnt actgtcggac tgttcancca ccaactctac aagttgctgt 60 

cttccactca ctgtctgtaa gcntnttaac ccagactgta tcttcataaa tagaacaaat 120 

tcttcaccag tcacatcttc taggaccttt ttggattcag ttagtataag ctcttccact 180 

tcctttgtta agacttcatc tggtaaagtc ttaagttttg tagaaaggaa tttaattgct 240 

cgttctctaa caatgtcctc tccttgaagt atttggctga acaacccacc tnaagtccct 300 

ttgtgcatcc attttaaata tacttaatag ggcattggtn cactaggtta aattctgcaa 360 

gagtcatctg tctgcaaaag ttgcgttagt atatctgcca 400 



<210> 91 

<211> 480 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> {!)... (480) 
<223> n = A, T,C or G 



<400> 91 

gagctcggat ccaataatct ttgtctgagg gcagcacaca tatncagtgc catggnaact 60 

ggtctacccc acatgggagc agcatgccgt agntatataa ggtcattccc tgagtcagac 120 

atgcctcttt gactaccgtg tgccagtgct ggtgattctc acacacctcc nnccgctctt 180 

tgtggaaaaa ctggcacttg nctggaacta gcaagacatc acttacaaat tcacccacga 240 

gacacttgaa aggtgtaaca aagcgactct tgcattgctt tttgtccctc cggcaccagt 300 

tgtcaatact aacccgctgg tttgcctcca tcacatttgt gatctgtagc tctggataca 360 

tctcctgaca gtactgaaga acttcttctt ttgtttcaaa agcaactctt ggtgcctgtt 420 

ngatcaggtt cccatttccc agtccgaatg ttcacatggc atatnttact tcccacaaaa 480 



<210> 92 

<21 1> 477 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> (1) . . . (477) 
<223> n = A, T,C or G 



<400> 92 

atacagccca natcccacca cgaagatgcg cttgttgact gagaacctga tgcggtcact 60 

ggtcccgctg tagccccagc gactctccac ctgctggaag cggttgatgc tgcactcctt 120 

cccacgcagg cagcagcggg gccggtcaat gaactccact cgtggcttgg ggttgacggt 180 

taantgcagg aagaggctga ccacctcgcg gtccaccagg atgcccgact gtgcgggacc 24 0 

tgcagcgaaa ctcctcgatg gtcatgagcg ggaagcgaat gangcccagg gccttgccca 300 

gaaccttccg cctgttctct ggcgtcacct gcagctgctg ccgctnacac tcggcctcgg 360 

accagcggac aaacggcgtt gaacagccgc acctcacgga tgcccantgt gtcgcgctcc 420 

aggaacggcn ccagcgtgtc caggtcaatg tcggtgaanc ctccgcgggt aatggcg 4 77 



<210> 93 

<211> 377 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc feature 
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<222> (1) . . . (377) 
<22 3> n - A, T, C or G 



<400> 93 

qaacggctgg accttgcctc gcattgtgct gctggcagga ataccttggc aagcagctcc 60 

agtccgagca gccccagacc gctgccgccc gaagctaagc ctgcctctgg ccttcccctc 120 

cgc""tcaatg cagaaccant agtgggagca ctgtgtttag agttaagagt gaacactgtn 180 

Lgactttact tgggaatttc ctctgttata tagcttttcc caatgctaat ttccaaacaa 240 

::aacaacaaa ataacatgtt tgcctgttna gttgtataaa agtangtgat tctgtatnta 300 

aaqaaaatat tactgttaca tatactgctt gcaanttctg tatttattgg tnctctggaa 360 

-t.:^iatatat tattaaa 377 



<210> 94 

<211> 495 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (495) 
<223> n = A, T, C or G 



<400> 94 

^'^cctttgagg ggttagggtc cagttcccag tggaagaaac aggccaggag aantgcgtgc 60 

cgagctgang cagatttccc acagtgaccc cagagccctg ggctatagtc tctgacccct 120 

ccaaggaaag accaccttct ggggacatgg gctggagggc aggacctaga ggcaccaagg 180 

gaaggcccca ttccggggct gttccccgag gaggaaggga aggggctctg tgtgcccccc 240 

acgaggaana ggccctgant cctgggatca nacacccctt cacgtgtatc cccacacaaa 300 

tgcaagctca ccaaggtccc ctctcagtcc cttccctaca ccctgaacgg ncactggccc 360 

acacccaccc agancancca cccgccatgg ggaatgtnct caaggaatcg cngggcaacg 420 

tgqactctng tcccnnaagg gggcagaatc tccaatagan gganngaacc cttgctnana 4 80 

aaaaaaaana aaaaa 4 95 



<210> 95 

<211> 472 

<212> DNA 

<213> Homo sapien 

<220> 

<221> niisc_f eature 
<222> (1) . . . (472) 
<223> n = A, T, C or G 



<400> 95 

cujl.i acttgg tttcattgcc accacttagt ggatgtcatt tagaaccatt ttgtctgctc 60 

cctctggaag ccttgcgcag agcggacttt gtaattgttg gagaataact gctgaatttt 120 

^lagctgtttt gagttgattc gcaccactgc accacaactc aatatgaaaa ctatttnact 180 

tatttattat cttgtgaaaa gtatacaatg aaaattttgt tcatactgta tttatcaagt 240 

atgatgaaaa gcaatagata tatattcttt tattatgttn aattatgatt gccattatta 300 

atcggcaaaa tgtggagtgt atgttctttt cacagtaata tatgcctttt gtaacttcac 360 

Itggttattt tattgtaaat gaattacaaa attcttaatt taagaaaatg gtangttata 420 

tttanttcan taatttcttt ccttgtttac gttaattttg aaaagaatgc at 472 



<21C> 96 

<211> 476 

<212> DNA 

<213> Homo sapien 
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<220> 

<221> misc^f eature 

<222> (1) . . . (476) 

<223> n = A,T,C or G 



<400> 96 

ctgaagcatt tcttcaaact tntctacttt tgtcattgat acctgtagta agttgacaat 60 

gtggtgaaat ttcaaaatta tatgtaactt ctactagttt tactttctcc cccaagtctt 120 

ttttaactca tgatttttac acacacaatc cagaacttat tatatagcct ctaagtcttt 180 

attcttcaca gtagatgatg aaagagtcct ccagtgtctt gngcanaatg ttctagntat 240 

agctggatac atacngtggg agttctataa actcatacct cagtgggact naaccaaaat 300 

tgtgttagtc tcaattccta ccacactgag ggagcctccc aaatcactat attcttatct 360 

gcaggtactc ctccagaaaa acngacaggg caggcttgca tgaaaaagtn acatctgcgt 420 

tacaaagtct atcttcctca nangtctgtn aaggaacaat ttaatcttct agcttt 476 



<210> 97 

<211> 479 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (479) 
<223> n = A,T,C or G 

<400> 97 

actctttcta atgctgatat gatcttgagt ataagaatgc atatgtcact agaatggata 60 

aaataatgct gcaaacttaa tgttcttatg caaaatggaa cgctaatgaa acacagctta 120 

caatcgcaaa tcaaaactca caagtgctca tctgttgtag atttagtgta ataagactta 180 

gattgtgctc cttcggatat gattgtttct canatcttgg gcaatnttcc ttagtcaaat 240 

caggctacta gaattctgtt attggatatn tgagagcatg aaatttttaa naatacactt 300 

gtgattatna aattaatcac aaatttcact tatacctgct atcagcagct agaaaaacat 360 

iitrinttttta natcaaagta ttrtg-cgtrr ggaantgtnn aaatgaaatc tgaatgtggg 420 

ttcnatctta ttttttcccn gacnactant tnctttttta gggnctattc tganccatc 47 9 

<210> 98 

<211> 461 

<212> DNA 

<213> Homo sapien 

<400> 98 

agtgacttgt cctccaacaa aaccccttga tcaagtttgt ggcactgaca atcagaccta 60 

tgctagttcc tgtcatctat tcgctactaa atgcagactg gaggggacca aaaaggggca 120 

tcaactccag ctggattatt ttggagcctg caaatctatt cctacttgta cggactttga 180 

agtgattcag tttcctctac ggatgagaga ctggctcaag aatatcctca tgcagcttta 240 

tgaagccact ctgaacacgc tggttatcta gatgagaaca gagaaataaa gtcagaaaat 300 

ttacctggag aaaagaggct ttggctgggg accatcccat tgaaccttct cttaaggact 360 

ttaagaaaaa ctaccacatg ttgtgtatcc tggtgccggc cgtttatgaa ctgaccaccc 420 

tttggaataa tcttgacgct cctgaacttg ctcctctgcg a 4 61 

<210> 99 

<211> 171 

<212> DNA 

<213> Homo sapien 

<400> 99 

gtggccgcgc gcaggtgttt cctcgtaccg cagggccccc tcccttcccc aggcgtccct 60 
cggcgcctct gcgggcccga ggaggagcgg ctggcgggtg gggggagtgt gacccaccct 120 
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roai^qaqaaa agccttctct agcgatctqa gaggcgtgcc ttgggggtac c 171 

<210> 100 
<211> 269 
<212> DNA 
<213> Homo sapien 



<400> 100 

cgqccgcaag tgcaactcca gctggggccg tgcggacgaa gattctgcca gcagttggtc 60 

cg^ictgcgac gacggcggcg gcgacagtcg caggtgcagc gcgggcgcct ggggtcttgc 120 

aaagotgagc tgacgccgca gaggtcgtgt cacgtcccac gaccttgacg ccgtcgggga 180 

cagccggaac agagcccggt gaagcgggag gcctcgggga gcccctcggg aagggcggcc 240 

cgaqagatac gcaggtgcag gtggccgcc 269 



<210> 101 
<211> 405 
<212> DNA 
<213> Homo sapien 

<400> 101 

tttttttttt ttttggaatc tactgcgagc acagcaggtc agcaacaagt ttattttgca 60 

gctagcaagg taacagggta gggcatggtt acatgttcag gtcaacttcc tttgtcgtgg 120 

tt.gattggtt tgtctttatg ggggcggggt ggggtagggg aaacgaagca aataacatgg 180 

agtgggtgca ccctccctgt agaacctggt tacaaagctt ggggcagttc acctggtctg 240 

tgaccgtcat tttcttgaca tcaatgttat tagaagtcag gatatctttt agagagtcca 300 

ctgttctgga gggagattag ggtttcttgc caaatccaac aaaatccact gaaaaagttg 360 

gatgatcagt acgaataccg aggcatattc tcatatcggt ggcca 405 

<210> 102 

<211> 470 

<212> DNA 

<213> Homo sapien 

<400> 102 

tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 60 

ggcacttaat ccatttttat ttcaaaatgt ctacaaattt aatcccatta tacggtattt 120 

Lcaaaatcta aattattcaa attagccaaa tccttaccaa ataataccca aaaatcaaaa 180 

atatacttct ttcagcaaac ttgttacata aattaaaaaa atatatacgg ctggtgtttt 240 

caaagtacaa ttatcttaac actgcaaaca ttttaaggaa ctaaaataaa aaaaaacact 300 

ccgcaaaggt taaagggaac aacaaattct tttacaacac cattataaaa atcatatctc 360 

aaatcttagg ggaatatata cttcacacgg gatcttaact tttactcact ttgtttattt 420 

ttttaaacca ttgtttgggc ccaacacaat ggaatccccc ctggactagt 470 

<210> 103 
<211> 581 
<212> DNA 
<213> Homo sapien 



<400> 103 

tttttttttt ttttttttga cccccctctt ataaaaaaca agttaccatt ttattttact 60 

lacacatatt tattttataa ttggtattag atattcaaaa ggcagctttt aaaatcaaaa 120 

taaatggaaa ctgccttaga tacataattc ttaggaatta gcttaaaatc tgcctaaagt 180 

gaaaatcttc tctagctctt ttgactgtaa atttttgact cttgtaaaac atccaaattc 240 

atttttcttg tctttaaaat tatctaatct ttccattttt tccctattcc aagtcaattt 300 

gcttctctag cctcatttcc tagctcttat ctactattag taagtggctt ttttcctaaa 360 

agggaaaaca ggaagagaaa tggcacacaa aacaaacatt ttatattcat atttctacct 420 

acgttaataa aatagcattt tgtgaagcca gctcaaaaga aggcttagat ccttttatgt 480 

ccattttagt cactaaacga tatcaaagtg ccagaatgca aaaggtttgt gaacatttat 540 
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tcaaaagcta atataagata tttcacatac tcatctttct 



581 



<210> 104 
<211> 578 
<2i2> DNA 
<213> Homo sapien 



<400 
tttttttttt 
cactctctag 
ctcttatgct 
aggaaatctg 
gaggtttttc 
ttcatgcaaa 
caaaactgct 
aaatcacatt 
aaaggaacat 
tgaattcaca 



> 104 
tttttttttt 
atagggcatg 
atatcatatt 
ttcattcttc 
ttctctattt 
ctagaaaata 
caaattgttt 
tacgacagca 
ttttagcctg 
tgttattatt 



tttttctctt 
aagaaaactc 
ttaagttaaa 
tcattcatat 
acacatatat 
atgtttcttt 
gttaagttat 
ataataaaac 
ggtataatta 
cctagcccaa 



cttttttttt 
atctttccag 
ctaatgagtc 
agttatatca 
ttccatgtga 
tgcataagag 
ccattataat 
tgaagtacca 
gctaattcac 
cacaatgg 



gaaatgagga 
ctttaaaata 
actggcttat 
agtactacct 
atttgtatca 
aagagaacaa 
tagttggcag 
gttaaatatc 
tttacaagca 



tcgagttttt 
acaatcaaat 
cttctcctga 
tgcatattga 
aacctttatt 
tatagcatta 
gagctaatac 
caaaataatt 
tttattagaa 



<210> 105 
<211> 538 
<212> DNA 
<213> Homo sapien 

<400> 105 
tttttttttt tttttcagta ataatcagaa 
gaaaagtgcc ttacatttaa taaaagtttg 
gtcttgaaca ccaatattaa tttgaggaaa 
aagatcatag agcttgtaag tgaaaagata 
aaatccacta ttagcaaata aattactatg 
ggggtgtcac tggtaaacca acacattctg 
tgtactttgc taatacgtgg atatgagttg 
ggcgagaaat gaggaagaaa agaaaaggat 
agatatgttt cctttgccaa tattaaaaaa 

<210> 106 

<211> 473 

<212> DNA 

<213> Homo sapien 

<400> 106 
tttttttttt ttttttagtc aagtttctat 
atttattagc tctgcaactt acatatttaa 
tttataaatg taaggtgcca ttattgagta 
tctcccacca actaatgaac agcaacatta 
gcaaacgcta attctcttct ccatccccat 
aatgcatcac aatctacaat caacagcaag 
agactgtgtc tgtctgaatc aaatgatctg 
ccgcttcctc aaaggcgctg ccacatttgt 



caatatttat 
tttctcaaag 
atacaccaaa 
aaatttgacc 
gacttcttgc 
aaggatacat 
acaagtttct 
Lacgcatact 
ataataatgt 



ttttatattt 
tgatcagagg 
atacattaag 
tcagaaactc 
tttaattttg 
tacttagtga 
ctttcttcaa 
gttctttcta 
ttactactag 



aaaattcata 
aattagatat 
taaattattt 
tgagcattaa 
tgatgaatat 
tagattctta 
tcttttaagg 
tggaaggatt 
tgaaaccc 



ttttattata 
attaaagaaa 
atatattcct 
gtttaatttt 
gtgatattgt 
atgaagctag 
acctatcctc 
ggctctttgc 



attaaagtct 
cgttttagac 
ccaagagtgg 
attagtagat 
gtatatgtgt 
gctgggcttt 
ggtggcaaga 
acttgtttca 



tggtcatttc 
aactgtacaa 
atgtgtccct 
atacactgct 
gagttggtag 
cggtgaaaat 
actcttcgaa 
aaa 



<210> 107 

<211> 1621 

<212> DNA 

<213> Homo sapien 

<400> 107 

cgccatggca ctgcagggca tctcggtcat ggagctgtcc ggcctggccc cgggcccgtt 
ctgtgctatg gtcctggctg acttcggggc gcgtgtggta cgcgtggacc ggcccggctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
578 



60 
120 
180 
240 
300 
360 
420 
460 
538 



60 
120 
180 
240 
300 
360 
420 
473 



60 
120 
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rcgcracgac atgagccgct tgggccgggq caagcgctcg ctagtgctgg acctgaagca 180 

accgcgggga gccgccgtgc rgcggcgtct gugcaagcgg tcggatgtgc tgctggagcc 240 

rt.tr:cgccqc qgrgtcatgg agaaactcca gctgggccca gagattctgc agcgggaaaa 300 

rccaaggctt atttatgcca ggctgagtgg atttggccag tcaggaagct tctgccggtt 360 

agciggccac aatatcaact atttggcttt gtcaggtgtt ctctcaaaaa ttggcagaag 420 

t^gqtgagaat ccgtatgccc cgctgaatct cctggctgac tttgctggtg gtggccttat 480 

grgtgcactg ggcattataa tggctctttt tgaccgcaca cgcactgaca agggtcaggt 540 

^attgatgca aatatggtgg aaggaacagc atatttaagt tcttttctgt ggaaaactca 600 

gaaatcgaqt ctgtgggaag cacctcgagg acagaacatg ttggatggtg gagcaccttt 660 

ctatacgact tacaggacag cagatgggga attcatggct gttggagcaa tagaacccca 720 

gr'ctacgag ctqctgatca aaggacttgg actaaagtct gatgaacttc ccaatcagat 780 

gagcatggat gattggccag aaatgaagaa gaagtttgca gatgtatttg caaagaagac 840 

gaaggcaqag tggtgrcaaa rctttgacgg cacagatgcc tgtgtgactc cggttctgac 900 

ttttgaggaq ottgttcatc atgatcacaa caaggaacgg ggctcgttta tcaccagtga 960 

ggagcaggac atgagccccc gccctgcacc tctgctgtta aacaccccag ccatcccttc 1020 

tttcaaaagi gatcctttca taggagaaca cactgaggag atacttgaag aatttggatt 1080 

cagccgcgaa qaqnlttatc agcttaactc agataaaatc attgaaagta ataaggtaaa 1140 

agctagtctc taacitccaq gcccacggct caagtgaatt tgaatactgc atttacagtg 1200 

tagagtaaca cataacatlg tatgcatgga aacatggagg aacagtatta cagtgtccta 1260 

ccactctaat ca<igaaaaga attacagact ctgattctac agtgatgatt gaattctaaa 1320 

aatggttatc attagggctt ttgatttata aaactttggg tacttatact aaattatggt 1380 

agttattctq ccttccaatt tqcttgatat atttgttgat attaagattc ttgacttata 1440 

ttttgaatgg attctagtga aaaaggaatg atatattctt gaagacatcg atatacattt 1500 

atttacactc ttgattctac aatgtagaaa atgaggaaat gccacaaatt gtatggtgat 1560 

aaaagtcacq t.gaaacaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1620 

a 1621 



<21C> 106 

<211> 382 

<212> PHT 

<213> Hoxo sapien 

<400> 108 
M(j1 Ala Leu Gin Gly lie Ser 

1 5 
Gly Pro Phe Cys Ala Met Val 
20 

Arg Val Asp Arg Pro Gly Ser 
35 

Gly Lys Arg Ser Leu Val Leu 

50 55 
Val Leu Arg Arg Leu Cys Lys 
65 70 
Arg Arg Gly Val Met Glu Lys 
85 

Arg Glu Asn Pro Arg Leu He 
100 

Ser Gly Ser Phe Cys Arg Leu 
115 

Leu Ser Gly Val Leu Ser Lys 
130 135 
Ala Pro Leu Asn Leu Leu Ala 

145 150 
Ala Leu Gly He He Met Ala 
165 

Gly Gin Val He Asp Ala Asn 
180 

Ser Phe Leu Trp Lys Thr Gin 



Val Met Glu Leu Ser Gly Leu Ala Pro 

10 15 
Leu Ala Asp Phe Gly Ala Arg Val Val 

25 30 
Arg Tyr Asp Val Ser Arg Leu Gly Arg 
40 45 
Asp Leu Lys Gin Pro Arg Gly Ala Ala 
60 

Arg Ser Asp Val Leu Leu Glu Pro Phe 

75 80 
Leu Gin Leu Gly Pro Glu He Leu Gin 

90 95 
Tyr Ala Arg Leu Ser Gly Phe Gly Gin 

105 110 
Ala Gly His Asp He Asn Tyr Leu Ala 
120 125 
He Gly Arg Ser Gly Glu Asn Pro Tyr 
140 

Asp Phe Ala Gly Gly Gly Leu Met Cys 
155 160 
Leu Phe Asp Arg Thr Arg Thr Asp Lys 

170 175 
Met Val Glu Gly Thr Ala Tyr Leu Ser 

185 190 
Lys Ser Ser Leu Trp Glu Ala Pro Arg 
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195 200 205 



Gly 


Gin 


Asn 


Met 


Leu 


Asp 


Gly 


Glv 

W J. 


Ala 


Pro 


Phe 


Tvr 


Thr 


Thr 


Tvr 


Arg 




210 










215 










220 








Thr 


Ala 


Asp 


Gly 


Glu 


Phe 


Met 


Ala 


Val 


Gly Ala 


He 


Glu 


Pro 


Gin 


Phe 


225 










230 










235 










240 


Tyr 


Glu 


Leu 


Leu 


He 
245 


Lys 


Gly 


Leu 


Gly 


Leu 
250 


Lys 


Ser 


Asp 


Glu 


Leu 

255 


Pro 


Asn 


Gin 


Met 


Ser 
260 


Met 


Asp 


Asp 


Trp 


Pro 
265 


Glu 


Met 


Lys 


Lys 


Lys 
270 


Phe 


Ala 


Asp 


Val 


Phe 


Ala 


Lys 


Lys 


Thr 


Lvs 


Ala 


Glu 


Trp 


Cys 


Gin 


He 


Phe 


Asp 






275 










280 










285 






Gly 


Thr 

290 


Asp 


Ala 


Cys 


Val 


Thr 
295 


Pro 


Val 


Leu 


Thr 


Phe 
300 


Glu 


Glu 


Val 


Val 


His 


His 


Asp 


His 


Asn 


Lys 


Glu 


Arg 


Gly 


Ser 


Phe 


He 


Thr 


Oc; J. 


mil 


*jX Li 


305 










310 










315 










320 


Gin 


Asp 


Val 


Ser 


Pro 
325 


Arg 


Pro 


Ala 


Pro 


Leu 
330 


Leu 


Leu 


Asn 


Thr 


Pro 
335 


Ala 


lie 


Pro 


Ser 


Phe 


Lys 


Arg 


Asp 


Pro 


Phe 


He 


Gly Glu 


His 


Thr 


Glu 


Glu 








340 










345 










350 






He 


Leu 


Glu 
355 


Glu 


Phe 


Gly 


Phe 


Ser 
360 


Arg 


Glu 


Glu 


He 


Tyr 
365 


Gin 


Leu 


Asn 


Ser 


Asp 
370 


Lys 


He 


He 


Glu 


Ser 

375 


Asn 


Lys 


Val 


Lys 


Ala 
380 


Ser 


Leu 







<210> 109 

<211> 1524 

<212> DNA 

<213> Homo sapien 

<400> 109 

ggcacgaggc tgcgccaggg cctgagcgga ggcgggggca gcctcgccag cgggggcccc 60 

gggcctggcc atgcctcact gagccagcgc ctgcgcctct acctcgccga cagctggaac 120 

cagtgcgacc tagtggctct cacctgcttc ctcctgggcg tgggctgccg gctgaccccg 160 

ggtttgtacc acctgggccg cactgtcctc tgcatcgact tcatggtttt cacggtgcgg 240 

ctgcttcaca tcttcacggt caacaaacag ctggggccca agatcgtcat cgtgagcaag 300 

atgatgaagg acgtgttctt cttcctcttc ttcctcggcg tgtggctggt agcctatggc 360 

gtggccacgg aggggctcct gaggccacgg gacagtgact tcccaagtat cctgcgccgc 420 

gtcttctacc gtccctacct gcagatcttc gggcagattc cccaggagga catggacgtg 480 

gccctcatgg agcacagcaa ctgctcgtcg gagcccggct tctgggcaca ccctcctggg 540 

gcccaggcgg gcacctgcgt ctcccagtat gccaactggc tggtggtgct gctcctcgtc 600 

atcttcctgc tcgtggccaa catcctgctg gtcaacttgc tcattgccat gttcagttac 660 

acattcggca aagtacaggg caacagcgat ctctactgga aggcgcagcg ttaccgcctc 720 

atccgggaat tccactctcg gcccgcgctg gccccgccct ttatcgtcat ctcccacttg 780 

cgcctcctgc tcaggcaatt gtgcaggcga ccccggagcc cccagccgtc ctccccggcc 840 

ctcgagcatt tccgggttta cctttctaag gaagccgagc ggaagctgct aacgtgggaa 900 

tcggtgcata aggagaactt tctgctggca cgcgctaggg acaagcggga gagcgactcc 960 

gagcgtctga agcgcacgtc ccagaaggtg gacttggcac tgaaacagct gggacacatc 1020 

cgcgagtacg aacagcgcct gaaagtgctg gagcgggagg tccagcagtg tagccgcgtc 108 0 

ctggggtggg tggccgaggc cctgagccgc tctgccttgc tgcccccagg tgggccgcca 114 0 

ccccctgacc tgcctgggtc caaagactga gccctgctgg cggacttcaa ggagaagccc 1200 

ccacagggga ttttgctcct agagtaaggc tcatctgggc ctcggccccc gcacctggtg 1260 

gccttgtcct tgaggtgagc cccatgtcca tctgggccac tgtcaggacc acctttggga 1320 

gtgtcatcct tacaaaccac agcatgcccg gctcctccca gaaccagtcc cagcctggga 1380 

ggatcaaggc ctggatcccg ggccgttatc catctggagg ctgcagggtc cttggggtaa 144 0 

cagggaccac agacccctca ccactcacag attcctcaca ctggggaaat aaagccattt 1500 

cagaggaaaa aaaaaaaaaa aaaa 1524 

<210> 110 
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<211> 3410 

<212> DNA 

<213> Homo sapien 



<400> 110 

gggaaccagc ctgcacgcgc tggctccggg tgacagccgc gcgcctcggc caggatctga 60 

gtgatgagac gtqtccccac tqaggtgccc cacagcagca ggtgttgagc argggctgag 120 

aagctggacc ggcaccaaag ggctggcaga aatgggcgcc tggctgattc ctaggcagtt 180 

ggcggcagca aggaggagag gccgcagctt ctggagcaga gccgagacga agcagttctg 240 

gagtgcctga acggccccct gagccctacc cgcctggccc actatggtcc agaggctgtg 300 

qgtgagccgc ctgctgcggc accggaaagc ccagctcttg ctggtcaacc tgctaacctt 360 

tggcctggag gtgtgtttgg ccgcaggcat cacctatgtg ccgcctctgc tgctggaagt 420 

gggggtagag gagaagttca tgaccatggt gctgggcatt ggtccagtgc tgggcctggt 480 

ctgtgtcccg ctcctaggct cagccagtga ccactggcgt ggacgctatg gccgccgccg 540 

gcccttcatc tgggcactgt ccttgggcat cctgctgagc ctctttctca tcccaagggc 600 

cggctggcta gcagggctgc tgtgcccgga tcccaggccc ctggagctgg cactgctcat 660 

cctgggcgtg gggctgctgg acttctgtgg ccaggtgtgc ttcactccac tggaggccct 720 

gctctctgac ctcttccggg acccggacca ctgtcgccag gcctactctg tctatgcctt 780 

catgatcagt cttgggggct gcctgggcta cctcctgcct gccattgact gggacaccag 840 

tqccctggcc ccctacctgg gcacccagga ggagtgcctc tttggcctgc tcaccctcat 900 

cttcctcacc tgcgtagcag ccacactgct ggtggctgag gaggcagcgc tgggccccac 960 

cqagccagca gaagggctgt cggccccctc cttgtcgccc cactgctgtc catgccgggc 1020 

ccgcttggct ttccggaacc tgggcgccct gcttccccgg ctgcaccagc tgtgctgccg 1080 

r.dtgccccgc accctgcgcc ggctcttcgt ggctgagctg tgcagctgga tggcactcat 1140 

gaccttcacg ctgttttaca cggatttcgt gggcgagggg ctgtaccagg gcgtgcccag 1200 

agctgagccg ggcaccgagg cccggagaca ctatgatgaa ggcgttcgga tgggcagcct 1260 

ggggctgttc ctgcagtgcg ccatctccct ggtcttctct ctggtcatgg accggctggt 1320 

gcagcgattc ggcactcgag cagtctattt ggccagtgtg gcagctttcc ctgtggctgc 1380 

cggtgccaca tgcctgtccc acagtgtggc cgtggtgaca gcttcagccg ccctcaccgg 1440 

gttcaccttc tcagccctgc agatcctgcc ctacacactg gcctccctct accaccggga 1500 

gaagcaggtg ttcctgccca aataccgagg ggacactgga ggtgctagca gtgaggacag 1560 

cctgatgacc agcttcctgc caggccctaa gcctggagct cccttcccta atggacacgt 1620 

cfggtgctgga ggcagtggcc tgctcccacc tccacccgcg ctctgcgggg cctctgcctg 1680 

Lgatgtctcc gtacgtgtgg tggtgggtga gcccaccgag gccagggtgg ttccgggccg 1740 

gggcatctgc ctggacctcg ccatcctgga tagtgccttc ctgctgtccc aggtggcccc 1800 

atccctgttt atgggctcca ttgtccagct cagccagtct gtcactgcct atatggtgtc 1860 

tgccgcaggc ctgggtctgg tcgccattta ctttgctaca caggtagtat ttgacaagag 1920 

cgacttggcc aaatactcag cgtagaaaac ttccagcaca ttggggtgga gggcctgcct 1980 

cactgggtcc cagctccccg ctcctgttag ccccatgggg ctgccgggct ggccgccagt 204 0 

ttctgttgct gccaaagtaa tgtggctctc tgctgccacc ctgtgctgct gaggtgcgta 2100 

gctgcacagc tgggggctgg ggcgtccctc tcctctctcc ccagtctcta gggctgcctg 2160 

actggaggcc ttccaagggg gtttcagtct ggacttatac agggaggcca gaagggctcc 2220 

atgcactgga atgcggggac tctgcaggtg gattacccag gctcagggtt aacagctagc 2280 

ctcctagttg agacacacct agagaagggt ttttgggagc tgaataaact cagtcacctg 2340 

qtttcccatc tctaagcccc ttaacctgca gcttcgttta atgtagctct tgcatgggag 2400 

tttctaggat gaaacactcc tccatgggat ttgaacatat gacttatttg taggggaaga 2460 

gtcctgaggg gcaacacaca agaaccaggt cccctcagcc cacagcactg tctttttgct 2520 

gatccacccc cctcttacct tttatcagga tgtggcctgt tggtccttct gttgccatca 2580 

cagagacaca ggcatttaaa tatttaactt atttatttaa caaagtagaa gggaatccat 2640 

tgctagcttt tctgtgttgg tgtctaatat ttgggtaggg tgggggatcc ccaacaatca 2700 

ggtcccctga gatagctggt cattgggctg atcattgcca gaatcttctt ctcctggggt 2760 

ctggcccccc aaaatgccta acccaggacc ttggaaattc tactcatccc aaatgataat 2820 

tccaaatgct gttacccaag gttagggtgt tgaaggaagg tagagggtgg ggcttcaggt 2880 

ctcaacggct tccctaacca cccctcttct cttggcccag cctggttccc cccacttcca 2940 

ctcccctcta ctctctctag gactgggctg atgaaggcac tgcccaaaat ttcccctacc 3000 

cccaactttc ccctaccccc aactttcccc accagctcca caaccctgtt tggagctact 3060 

gcaggaccag aagcacaaag tgcggtttcc caagcctttg tccatctcag cccccagagt 3120 

atat.ctgtgc ttggggaatc tcacacagaa actcaggagc accccctgcc tgagctaagg 3180 
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gaggtcttat ctctcagggg gggtttaagt gccgtttgca ataatgtcgt cttatttatt 3240 

tagcggggtq aatattttat actgtaagtg agcaatcaga gtataatgtt tatggtgaca 3300 

aaattaaagg ctttcttata tgtttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3360 

aaaaaaaara aaaaaaaaaa aaaaaaaaaa aaaaaaataa aaaaaaaaaa 3410 



<210> 111 

<211> 1289 

<212> DNA 

<213> Homo sapien 



<400> 111 

agccaggcgt ccctctgcct gcccactcag tggcaacacc cgggagctgt tttgtccttt 60 

gtggagcctc agcagttccc tctttcagaa ctcactgcca agagccctga acaggagcca 120 

ccatgcagtg cttcagcttc attaagacca tgatgatcct cttcaatttg ctcatctttc 180 

tgtgtggtgc agccctgttg gcagtgggca tctgggtgtc aatcgatggg gcatcctttc 240 

tgaagatctt cqggccactg tcgtccagtg ccatgcagtt tgtcaacgtg ggctacttcc 300 

tcatcgcagc cggcgttgtg gtctttgctc ttggtttcct gggctgctat ggtgctaaga 360 

ctgagagcaa gtgtgccctc gtgacgttct tcttcatcct cctcctcatc ttcattgctg 420 

aggttgcagc tgctgtggtc gccttggtgt acaccacaat ggctgagcac ttcctgacgt 480 

tgctggtagt gcctqccatc aagaaagatt atggttccca ggaagacttc actcaagtgt 540 

ggaacaccac catgaaaqgg ctcaagtgct gtggcttcac caactatacg gattttgagg 600 

actcacccta cttcaaagag aacagtgcct ttcccccatt ctgttgcaat gacaacgtca 660 

ccaacacagc caatgaaacc tgcaccaagc aaaaggctca cgaccaaaaa gtagagggtt 720 

gcttcaatca gcttttgtat gacatccgaa ctaatgcagt caccgtgggt ggtgtggcag 780 

ctggaattgg gggcctcgag ctggctgcca tgattgtgtc catgtatctg tactgcaatc 840 

tacaataagt ccacttctgc ctctgccact actgctgcca catgggaact gtgaagaggc 900 

accctggcaa gcagcagtga ttgggggagg ggacaggatc taacaatgtc acttgggcca 960 

gaatggacct gccctttctg ctccagactt ggggctagat agggaccact ccttttagcg 1020 

atgcctgact ttccttccat tggtgggtgg atgggtgggg ggcattccag agcctctaag 1080 

gtagccagtt ctgttgccca ttcccccagt ctattaaacc cttgatatgc cccctaggcc 1140 

tagtggtgat cccagtgctc tactggggga tgagagaaag gcattttata gcctgggcat 1200 

aagtgaaatc agcagagcct ctgggtggat gtgtagaagg cacttcaaaa tgcataaacc 1260 

tgttacaatg ttaaaaaaaa aaaaaaaaa 1289 



<210> 112 
<211> 315 
<212> PRT 
<213> Homo sapien 



<400> 112 



Met 


Val 


Phe 


Thr 


Val 


Arg 


Leu 


Leu 


His 


He 


Phe 


Thr 


Val 


Asn 


Lys 


Gin 


1 








5 










10 










15 




Leu 


Gly 


Pro 


Lys 
20 


lie 


Val 


He 


Val 


Ser 
25 


Lys 


Met 


Met 


Lys 


Asp 
30 


Val 


Phe 


Phe 


Phe 


Leu 
35 


Phe 


Phe 


Leu 


Gly 


Val 
40 


Trp 


Leu 


Val 


Ala 


Tyr 
45 


Gly 


Val 


Ala 


Thr 


Glu 
50 


Gly 


Leu 


Leu 


Arg 


Pro 
55 


Arg 


Asp 


Ser 


Asp 


Phe 
60 


Pro 


Ser 


lie 


Leu 


Arg 


Arg 


Val 


Phe 


Tyr 


Arg 


Pro 


Tyr 


Leu 


Gin 


He 


Phe 


Gly 


Gin 


He 


Pro 


65 










70 










75 










80 


Gin 


Glu 


Asp 


Met 


Asp 

85 


Val 


Ala 


Leu 


Met 


Glu 

90 


His 


Ser 


Asn 


Cys 


Ser 
95 


Ser 


Glu 


Pro 


Gly 


Phe 
100 


Trp 


Ala 


His 


Pro 


Pro 
105 


Gly 


Ala 


Gin 


Ala 


Gly 
110 


Thr 


Cys 


Val 


Ser 


Gin 
115 


Tyr 


Ala 


Asn 


Trp 


Leu 
120 


Val 


Val 


Leu 


Leu 


Leu 
125 


Val 


He 


Phe 


Leu 


Leu 
130 


Val 


Ala 


Asn 


He 


Leu 
135 


Leu 


Val 


Asn 


Leu 


Leu 
140 


He 


Ala 


Met 


Phe 
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Ser Tyr Thr Phe Gly Lys Val Gin Gly Asn Ser Asp Leu Tyr Trp Lys 
145 150 155 160 

Ma Gin Arg Tyr Arq Leu lie Arg Glu Phe His Ser Arg Pro Ala Leu 

165 170 175 

Ala Pro Pro Phe lie Val lie Ser His Leu Arg Leu Leu Leu Arg Gin 

180 185 190 

i.eu Cys Arg Arg Pro Arg Ser Pro Gin Pro Ser Ser Pro Ala Leu Glu 

195 200 205 

His Phe Arg Val Tyr Leu Ser Lys Glu Ala Glu Arg Lys Leu Leu Thr 

210 215 220 

Trc. Glu Ser Val His Lys Glu Asn Phe Leu Leu Ala Arg Ala Arg Asp 
225 230 235 240 

Lys Arg Glu Ser Asp Ser Glu Arg Leu Lys Arg Thr Ser Gin Lys Val 

245 250 255 

Asp Leu Ala Leu Lys Gin Leu Gly His He Arg Glu Tyr Glu Gin Arg 

260 265 270 

Leu Lys Val Leu Glu Arg Glu Val Gin Gin Cys Ser Arg Val Leu Gly 

275 " 280 285 

Trp Val Ala Glu Ala Leu Ser Arg Ser Ala Leu Leu Pro Pro Gly Gly 

290 295 300 

Pro Pro Pro Pro Asp Leu Pro Gly Ser Lys Asp 
30S 310 315 

<210> 113 

<211> 553 

<212> PRT 

<213> Homo sapien 

<400> 113 

Met Val Gin Arg Leu Trp Val Ser Arg Leu Leu Arg His Arg Lys Ala 

15 10 15 

GIt. Leu Leu Leu Val Asn Leu Leu Thr Phe Gly Leu Glu Val Cys Leu 

20 25 30 

Aid /Via Gly He Thr Tyr Val Pro Pro Leu Leu Leu Glu Val Gly Val 

35 40 45 

Glu Glu Lys Phe Met Thr Met Val Leu Gly He Gly Pro Val Leu Gly 

50 55 60 

Leu Val Cys Val Pro Leu Leu Gly Ser Ala Ser Asp His Trp Arg Gly 
65 7 0 75 8 0 

Arg Tyr Gly Arg Arg Arg Pro Phe He Trp Ala Leu Ser Leu Gly He 

85 90 95 

Leu Leu Ser Leu Phe Leu He Pro Arg Ala Gly Trp Leu Ala Gly Leu 

100 105 110 

Leu Cys Pro Asp Pro Arg Pro Leu Glu Leu Ala Leu Leu 11c Leu Gly 

115 120 125 

Val Gly Leu Leu Asp Phe Cys Gly Gin Val Cys Phe Thr Pro Leu Glu 

130 135 140 

Ala Leu Leu Ser Asp Leu Phe Arg Asp Pro Asp His Cys Arg Gin Ala 
145 150 155 160 

Tyr Ser Val Tyr Ala Phe Met He Ser Leu Gly Gly Cys Leu Gly Tyr 

165 170 175 

Leu Leu Pro Ala He Asp Trp Asp Thr Ser Ala Leu Ala Pro Tyr Leu 

180 185 190 

Gly Thr Gin Glu Glu Cys Leu Phe Gly Leu Leu Thr Leu He Phe Leu 

195 200 205 

Thr Cys Val Ala Ala Thr Leu Leu Val Ala Glu Glu Ala Ala Leu Gly 

210 215 220 

Pro Thr Glu Pro Ala Glu Gly Leu Ser Ala Pro Ser Leu Ser Pro His 
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225 

Cys Cys Pro Cys 

Leu Pro Arg Leu 
260 

Arg Leu Phe Val 

275 

Thr Leu Phe Tyr 
290 

Pro Arg Ala Glu 
305 

Val Arg Met Gly 

Val Phe Ser Leu 
340 

Ala Val Tyr Leu 
355 

Thr Cys Leu Ser 
370 

Thr Gly Phe Thr 
385 

Ser Leu Tyr His 

Asp Thr Gly Gly 
420 

Pro Gly Pro Lys 
435 

Gly Gly Ser Gly 
450 

Ala Cys Asp Val 
4 65 

Arg Val Val Pro 

Ser Ala Phe Leu 
500 

lie Val Gin Leu 
515 

Gly Leu Gly Leu 
530 

Lys Ser Asp Leu 
545 





o Q n 






Arg 


Ax a 


Arg 


Leu 


41 D 








nlS 


Cain 


Leu 


Cys 


Ala 


Glu 


Leu 


Cys 








280 


Thr 


Asp 


Phe 


Val 






o o c 




Pro 


bly 


Thr 


Glu 




OTA 

310 






Ser 


Leu 


Gly 


Leu 


O O c: 








vax 


Met 


Asp 


Arg 


Ala 


Ser 


Val 


Ala 








360 


His 


Ser 


Val 


Ala 






375 




Phe 


Ser 


Ala 


Leu 




390 






Arg 


Glu 


Lys 


Gin 


405 








Ala 


Ser 


Ser 


Glu 


Pro 


Gly Ala 


Pro 








440 


Leu 


Leu 


Pro 


Pro 






455 




Ser 


Val 


Arg 


Val 




470 






Gly 


Arg 


Gly 


He 


4 85 








Leu 


Ser 


Gin 


Val 


Ser 


Gin 


Ser 


Val 








520 


Val 


Ala 


He 


Tyr 






535 




Ala 


Lys 


Tyr 


Ser 




550 







235 

Ala Phe Arg Asn 
250 

Cys Arg Met Pro 
265 

Ser Trp Met Ala 

Gly Glu Gly Leu 
300 

Ala Arg Arg His 
315 

Phe Leu Gin Cys 
330 

Leu Val Gin Arg 
345 

Ala Phe Pro Val 

Val Val Thr Ala 
380 

Gin He Leu Pro 
395 

Val Phe Leu Pro 
410 

Asp Ser Leu Met 
425 

Phe Pro Asn Gly 

Pro Pro Ala Leu 
4 60 

Val Val Gly Glu 
475 

Cys Leu Asp Leu 
490 

Ala Pro Ser Leu 
505 

Thr Ala Tyr Met 

Phe Ala Thr Gin 
540 

Ala 



240 

Leu Gly Ala Leu 
255 

Arg Thr Leu Arg 
270 

Leu Met Thr Phe 
285 

Tyr Gin Gly Val 

Tyr Asp Glu Gly 
320 

Ala He Ser Leu 
335 

Phe Gly Thr Arg 
350 

Ala Ala Gly Ala 
365 

Ser Ala Ala Leu 

Tyr Thr Leu Ala 
400 

Lys Tyr Arg Gly 
415 

Thr Ser Phe Leu 
430 

His Val Gly Ala 
445 

Cys Gly Ala Ser 

Pro Thr Glu Ala 
480 

Ala He Leu Asp 
49b 

Phe Met Gly Ser 
510 

Val Ser Ala Ala 
525 

Val Val Phe Asp 



<210> 114 
<211> 241 
<212> PRT 
<213> Homo sapien 



<400> 114 
Met Gin Cys Phe Ser 

1 5 
Leu He Phe Leu Cys 
20 

Ser He Asp Gly Ala 
35 

Ser Ala Met Gin Phe 
50 

Val Val Val Phe Ala 
65 



Phe He Lys Thr Met Met 
10 

Gly Ala Ala Leu Leu Ala 

25 

Ser Phe Leu Lys He Phe 
40 

Val Asn Val Gly Tyr Phe 
55 

Leu Gly Phe Leu Gly Cys 
70 75 



He Leu Phe Asn Leu 
15 

Val Gly He Trp Val 
30 

Gly Pro Leu Ser Ser 
45 

Leu He Ala Ala Gly 
60 

Tyr Gly Ala Lys Thr 
80 
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Glu Ser Lys Cys Ala Leu Val Thr Phe Phe Phe lie Leu Leu Leu lie 

85 90 95 

Phe Ixe Ala Glu Val Ala Ala Ala Val Val Ala Leu Val Tyr Thr Thr 

100 105 110 

Mer Ala Glu His Phe Leu Thr Leu Leu Val Val Pro Ala lie Lys Lys 

115 120 125 

A:^p Tyr Gly Ser Gin Glu Asp Phe Thr Gin Val Trp Asn Thr Thr Met 

130 135 140 

Lys Gly Leu Lys Cys Cys Gly Phe Thr Asn Tyr Thr Asp Phe Glu Asp 
145 150 155 160 

Ser Pro Tyr Phe Lys Glu Asn Ser Ala Phe Pro Pro Phe Cys Cys Asn 

165 170 175 

Asp Asn Val Thr Asn Thr Ala Asn Glu Thr Cys Thr Lys Gin Lys Ala 

180 185 190 

His Asp Gin Lys Val Glu Gly Cys Phe Asn Gin Leu Leu Tyr Asp lie 

195 200 205 

Arg Thr Asn Ala Val Thr Val Gly Gly Val Ala Ala Gly lie Gly Gly 

210 215 220 

Leu Glu Leu Ala Ala Met lie Val Ser Met Tyr Leu Tyr Cys Asn Leu 
225 230 235 240 

Gin 



<210> 115 
<211> 366 
<212> DNA 
<213> HorTLO sapien 



<400> 115 

gctctttctc tcccctcctc tgaatttaat tctttcaact tgcaatttgc aaggattaca 60 

::atttcactg tgatgtatat tgtgttgcaa aaaaaaaaaa gtgtctttgt ttaaaattac 120 

^tqqtttgtg aatccatctt gctttttccc cattggaact agtcattaac ccatctctga 180 

acTCfgtagaa aaacatctga agagctagtc tatcagcatc tgacaggtga attggatggt 240 

tctcagaacc atttcaccca gacagcctgt ttctatcctg tttaataaat tagtttgggt 300 

tctctacatg cataacaaac cctgctccaa tctgtcacat aaaagtctgt gacttgaagt 360 

ttaqtc 366 



<210> 116 
<211> 282 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (282) 
<223> n = A, T,C or G 



<400> 116 

acaaagatga accatttcct atattatagc aaaattaaaa tctacccgta ttctaatatt 60 

gagaaatgag atnaaacaca atnttataaa gtctacttag agaagatcaa gtgacctcaa 120 

agactttact attttcatat tttaagacac atgatttatc ctattttagt aacctggttc 180 

atacgttaaa caaaggataa tgtgaacagc agagaggatt tgttggcaga aaatctatgt 240 

1 'Jciutctnga actatctana tcacagacat ttctattcct tt 282 



<210> 117 
<211> 305 

V JL ^ IJl^^Ti. 

<213> Homo sapien 
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<220> 

<221> mis cofeature 
<222> (305) 
<223> n - A, T,C or G 



<400> 117 

acacatgtcg cttcactgcc ttcttagatg cttctggtca acatanagga acagggacca 60 

tatttatcct ccctcctgaa acaattgcaa aataanacaa aatatatgaa acaattgcaa 120 

aataaggcaa aatatatgaa acaacaggtc tcgagatatt ggaaatcagt caatgaagga 180 

tactgatccc tgatcactgt cctaatgcag gatgtgggaa acagatgagg tcacctctgt 240 

gactgcccca gcttactgcc tgtagagagt ttctangctg cagttcagac agggagaaat 300 

tgggt 305 

<210> 118 

<211> 71 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (71) 
<223> n = A, T,C or G 



<400> 118 

accaaggtgt ntgaatctct gacgtgggga tctctgattc ccgcacaatc tgagtggaaa 60 
aantcctggg t y-^ 

<210> 119 
<211> 212 
<212> DNA 
<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (212) 
<223> n = A, T,C or G 



<400> 119 

actccggttg gtgtcagcag cacgtggcat tgaacatngc aatgtggagc ccaaaccaca 60 

gaaaatgggg tgaaattggc caactttcta tnaacttatg ttggcaantt tgccaccaac 120 

agtaagctgg cccttctaat aaaagaaaat tgaaaggttt ctcactaanc ggaattaant 180 

aatggantca aganactccc aggcctcagc gt 212 

<210> 120 

<211> 90 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (90) 
<223> n = A,T,C or G 



<400> 120 

actcgttgca natcaggggc cccccagagt caccgttgca ggagtccttc tggtcttgcc 60 
ctccgccggc gcagaacatg ctggggtggt 90 
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<.-10> 121 
<2il> 218 
<212> DNA 
<213> Komo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (218) 
<223> n = A, T, C or G 



<:400> 121 

t cr ::a Ticatga anacgacaga nagggttgtc aaaaatggag aanccttgaa gtcattttga 60 

qaataagatt tgctaaaaga tttggggcta aaacatggtt attggqagac atttctgaag 120 

ataVncangt aaattangga atgaattcat ggttcttttg ggaattcctt tacgatngcc 180 

agcatanact tcatgtgggg atancagcta cccttgta 218 



<210> 122 

<211> 171 

<212> DNA 

<213> Homo sapien 



<40Q> 122 

liaggggtgta tgcaactgta aggacaaaaa ttgagacuca actggcttaa ccaataaagg 60 

■::atttgttag ctcatggaac aggaagtcgg atggtggggc atcttcagtg ctgcatgagt 120 

caccaccccg gcggggtcat ctgtgccaca ggtccctgtt gacagtgcgg t 171 



<210> 123 

<211> 76 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc^feature 
<222> (1) . . . (76) 
<223> n - A, T,C or G 

<400> 123 

tgtagcgtga agacnacaga atggtgtgtg ctgtgctatc caggaacaca tttattatca 60 
I'.tatcaanta ttgtgt 7 6 

<210> 124 
<211> 131 
<212> DNA 
<213> Homo sapien 



<400> 124 

acctttcccc aaggccaatg tcctgtgtgc taactggccg gctgcaggac agctgcaatt 60 

caatgtgctg ggtcatatgg aggggaggag actctaaaat agccaatttt attctcttgg 120 

ttaagatttg t 131 



<210> 125 
<211> 432 
<212> DNA 
<213> Homo sapien 

<400> 125 

ac'.tttatcta ctggctatga aatagatggt ggaaaattgc qttaccaact ataccactgg 60 
cttgaaaaag aggtgatagc tcttcagagg acttgtgact tttgctcaga tgctgaagaa 120 
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ctacagtctg catttggcag aaatgaagat gaatttggat taaatgagga tgctgaagat 180 

ttgcctcacc aaacaaaagt gaaacaactg agagaaaatt ttcaggaaaa aagacagtgg 240 

ctcttgaagt atcagtcact tttgagaatg tttcttagtt actgcatact tcatggatcc 300 

catggtgggg gtcttgcatc tgtaagaatg gaattgattt tgcttttgca agaatctcag 360 

caggaaacat cagaaccact attttctagc cctctgtcag agcaaacctc agtgcctctc 420 

ctctttgctt gt ^39 

<210> 126 
<211> 112 
<212> DNA 
<213> Homo sapien 

<400> 126 

acacaacttg aatagtaaaa tagaaactga gctgaaattt ctaattcact ttctaaccat 60 
agtaagaatg atatttcccc ccagggatca ccaaatattt ataaaaattt gt 112 

<210> 127 

<211> 54 

<212> DNA 

<213> Homo sapien 

<400> 127 

accacgaaac cacaaacaag atggaagcat caatccactt gccaagcaca gcag 54 

<210> 128 
<211> 323 
<212> DNA 
<213> Homo sapien 

<400> 128 

acctcattag taattgtttt gttgtttcat ttttttctaa tgtctcccct ctaccagctc 60 

acctgagata acagaatgaa aatggaagga cagccagatt tctcctttgc tctctgctca 120 

^ ^^^^^a-^^^ c^v^^^a I, t- ^(-y yyyctuc^^ci t: t aLaygcaara aacacagrtc idU 

ccaaagcatt tggacagttt cttgttgtgt tttagaatgg ttttcctttt tcttagcctt 240 

ttcctgcaaa aggctcactc agtcccttgc ttgctcagtg gactgggctc cccagggcct 300 

aggctgcctt cttttccatg tec 323 

<210> 129 
<211> 192 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (192) 
<223> n = A,T,C or G 

<400> 129 

acatacatgt gtgtatattt ttaaatatca cttttgtatc actctgactt tttagcatac 60 

tgaaaacaca ctaacataat ttntgtgaac catgatcaga tacaacccaa atcattcatc 120 

tagcacattc atctgtgata naaagatagg tgagtttcat ttccttcacg ttggccaatg 180 

gataaacaaa gt 292 

<210> 130 
<211> 362 
<212> DNA 
<213> Homo sapien 
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<22Q> 

<221> misc_ feature 
<222> (1} . (362) 
<223> n - A,T,C or G 



<400> 130 

ccctttttta tggaatgagt agactgtatg tttgaanatt tanccacaac ctctttgaca 60 

.a::iatgacg caacaaaaag gtgctgttta gtcctatggt tcagtttatg cccctgacaa 120 

atttccattg tgttrtgccg atcttctggc taatcgtggt atcctccatg ttattagtaa 180 

t:tctgtattc cattttgtta acgcctggta gatgtaacct gctangaggc taactttata 240 

cLtatttaaa agctcttatt ttgtggtcat taaaatggca atttatgtgc agcactttat 300 

Tigcagcagga agcacgtgtg ggttggttgt aaagctcttt gctaatctta aaaagtaatg 360 

gq 362 



<210> 131 
<211> 332 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (332) 
<223> n - A, T, C or G 



<400> 131 

ctttttgaaa gatcgtgtcc actcctgtgg acatcttgtt ttaatggagt ttcccatgca 60 

gtangactgg tatggttgca gctgtccaga taaaaacatt tgaagagctc caaaatgaga 120 

gttctcccag gttcgccctg ctgctccaag tctcagcagc agcctctttt aggaggcatc 180 

ttctgaacta gattaaggca gcttgtaaat ctgatgtgat ttggtttatt atccaactaa 240 

cttccatctg ttatcactgg agaaagccca gactccccan gacnggtacg gattgtgggc 300 

atanaaggat tgggtgaagc tggcgttgtg gt 332 



<210> 132 
<211> 322 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (322) 
<223> n = A,T,C or G 



<400> 132 

acttttgcca ttttgtatat ataaacaatc ttgggacatt ctcctgaaaa ctaggtgtcc 60 

.igtggctaag agaactcgat ttcaagcaat tctgaaagga aaaccagcat gacacagaat 120 

ctcaaattcc caaacagggg ctctgtggga aaaatgaggg aggacctttg tatctcgggt 180 

" tl-agcaagt taaaatgaan atgacaggaa aggcttattt atcaacaaag agaagagttg 240 

ggatgcttct aaaaaaaact ttggtagaga aaataggaat gctnaatcct agggaagcct 300 

gtaacaatct acaattggtc ca 322 



<210> 133 
<211> 278 
<212> DNA 
<213> Homo sapien 

^220> 

<221> misc_feature 
<222> (1) ... (278) 
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<223> n = A, T,C or G 



<400> 133 

acaagccttc acaagtttaa ctaaattggg attaatcttt ctgtanttat ctgcataatt 60 

cttgtttttc tttccatctg gctcctgggt tgacaatttg tggaaacaac tctattgcta 120 

ctatttaaaa aaaatcacaa atctttccct ttaagctatg ttnaattcaa actattcctg 180 

ctattcctgt tttgtcaaag aaattatatt tttcaaaata tgtntatttg tttgatgggt 240 

cccacgaaac actaataaaa accacagaga ccagcctg 278 



<210> 134 
<211> 121 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (121) 
<223> n = A, T,C or G 

<400> 134 

gtttanaaaa cttgtttagc tccatagagg aaagaatgtt aaactttgta ttttaaaaca 60 

tgattctctg aggttaaact tggttttcaa atgttatttt tacttgtatt ttgcttttgg 120 

t 121 

<210> 135 
<211> 350 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (350) 
<223> n -= A, T,C or G 



<400> 135 

acttanaacc atgcctagca catcagaatc cctcaaagaa catcagtata atcctatacc 60 

atancaagtg gtgactggtt aagcgtgcga caaaggtcag ctggcacatt acttgtgtgc 120 

aaacttgata cttttgttct aagtaggaac tagtatacag tncctaggan tggtactcca 180 

gggtgccccc caactcctgc agccgctcct ctgtgccagn ccctgnaagg aactttcgct 240 

ccacctcaat caagccctgg gccatgctac ctgcaattgg ctgaacaaac gtttgctgag 300 

ttcccaagga tgcaaagcct ggtgctcaac tcctggggcg tcaactcagt 350 



<210> 136 
<211> 399 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (399) 
<223> n = A,T,C or G 



<400> 136 

tgtaccgtga agacgacaga agttgcatgg cagggacagg gcagggccga ggccagggtt 60 

gctgtgattg tatccgaata ntcctcgtga gaaaagataa tgagatgacg tgagcagcct 120 

gcagacttgt gtctgccttc aanaagccag acaggaaggc cctgcctgcc ttggctctga 180 

cctggcggcc agccagccag ccacaggtgg gcttcttcct tttgtggtga caacncceag 24 0 

aaaactgcag aggcccaggg tcaggtgtna gtgggtangt gaccataaaa caccaggtgc 300 



BNSDOCID <WO 



0151633A2 I 



wo 01/51633 



PC T/l S(H/(I1574 



50 



t. •;•c:■aaa^iac cccggcdaag gccatcccca cctacagcca gcatgcccac tggcgtgatg 
aatacagana OaLgaagcag ccagntgttc rgctgcggt 



360 
399 



<210> 137 
<211> 165 
<212> DNA 
< 2 1 3 > K omo s ap i e n 

<220> 

<221> misc_f eature 
<222> (1) ... (1 65) 
<223> n = A, T, C or G 

<40C> 137 

actggtgtg.i tnggqg;tga tgctggtggt anaagttgan gtgacttcan gatggtgtgt 60 

ggaqgaaqtg t^7tgaacgta gggatgtaga ngttttggcc gtgctaaatg agcttcggga 120 

ttgqctggtc ccdCtqjzgg tcactgtcat tggtggggtt cctgt 165 

<210> 13b 

<211> 3 3b 

<212> UNA 

<213> Homo sapicn 

<22C> 

<221> xis cofeature 
<222> (1) . . . (338) 
<223> n = A,T,C or G 

<400> 138 

actcactgga atgccacatt cacaacagaa tcagaggtct gtgaaaacat taatggctcc 60 

ttaacitctc cagtaagaat cagggacttg aaatggaaac gttaacagcc acatgcccaa 120 

rgctgggcag tctcccatgc cttccacagt gaaagggctt gagaaaaatc acatccaatg 180 

tcatgtgttt ccagccacac caaaaggtgc ttggggtgga gggctggggg catananggt 240 

cangcctcag gaagcctcaa gttccattca gctttgccac tgtacattcc ccatntttaa 300 

aaaaactgat gccttttttt tttttttttg taaaattc 338 

<210> 139 

<211> 382 

<212> DNA 

<213> Homo sapien 

<400> 139 

gggaatcttg gtttttggca tctggtttgc ctatagccga ggccactttg acagaacaaa 60 

gaaagggact tcgagtaaga aggtgattta cagccagcct agtgcccgaa gtgaaggaga 120 

attcaaacag acctcgtcat tcctggtgtg agcctggtcg gctcaccgcc tatcatctgc 180 

atttgcctta ctcaggtgct accggactct ggcccctgat gtctgtagtt tcacaggatg 240 

ccttatttgt cttctacacc ccacagggcc ccctacttct tcggatgtgt ttttaataat 300 

gtcagctatg "cgccccatcc tccttcatgc cctccctccc tttcctacca ctgctgagtg 360 

gcctggaact tgtttaaagt gt 382 

<210> 140 

<211> 200 

<212> DNA 

<213> Homo sapien 

<220> 

<221> nii sc feature 
<222> (1) .7. (200) 
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<223> n = A,T,C or G 



<400> 140 

accaaanctt ctttctgttg tgttngattt tactataggg gtttngcttn ttctaaanat 60 

acttttcatt taacancttt tgttaagtgt caggctgcac tttgctccat anaattattg 120 

ttttcacatt tcaacttgta tgtgtttgtc tcttanagca ttggtgaaat cacatatttt 180 

atattcagca taaaggagaa 200 



<210> 141 
<211> 335 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc feature 
<222> (1) .7. (335) 
<223> n = A,T,C or G 



<400> 141 

actttatttt caaaacactc atatgttgca aaaaacacat agaaaaataa agtttggtgg 60 

gggtgctgac taaacttcaa gtcacagact tttatgtgac agattggagc agggtttgtt 120 

atgcatgtag agaacccaaa ctaatttatt aaacaggata gaaacaggct gtctgggtga 180 

aatggttctg agaaccatcc aattcacctg tcagatgctg atanactagc tcttcagatg 240 

tttttctacc agttcagaga tnggttaatg actanttcca atggggaaaa agcaagatgg 300 

attcacaaac caagtaattt taaacaaaga cactt 335 



<210> 142 
<211> 459 
<212> DNA 
<213> Homo sapien 

<220> 

^'OO-l-^ -: . . 4r J ^_ 

--^i-U.-^ lliJLOO IticlLurt:; 

<222> {!)... (459) 
<223> n = A, T,C or G 

<400> 142 

accaggttaa tattgccaca tatatccttt ccaattgcgg gctaaacaga cgtgtattta 60 

gggttgttta aagacaaccc agcttaatat caagagaaat tgtgaccttt catggagtat 120 

ctgatggaga aaacactgag ttttgacaaa tcttatttta ttcagatagc agtctgatca 18 0 

cacatggtcc aacaacactc aaataataaa tcaaatatna tcagatgtta aagattggtc 24 0 

ttcaaacatc atagccaatg atgccccgct tgcctataat ctctccgaca taaaaccaca 300 

tcaacacctc agtggccacc aaaccattca gcacagcttc cttaactgtg agctgtttga 360 

agctaccagt ctgagcacta ttgactatnt ttttcangct ctgaatagct ctagggatct 42 0 

cagcangggt gggaggaacc agctcaacct tggcgtant 459 

<210> 143 
<211> 140 
<212> DNA 
<213> Homo sapien 

<400> 143 

acatttcctt ccaccaagtc aggactcctg gcttctgtgg gagttcttat cacctgaggg 60 

aaatccaaac agtctctcct agaaaggaat agtgtcacca accccaccca tctccctgag 120 

accatccgac ttccctgtgt 140 

<210> 144 
<211> 164 
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<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> {!)... (164 ) 
<223> n = A, T, C or G 



<400> 144 

dct r caqtaa caacatacaa taacaacatt aagtgtatat tgccatcttt gtcattttct 60 

uL.'t.citacca ctctcccttc tgaaaacaan aatcactanc caatcactta tacaaatttg 120 

aggcaar.taa tccatatttg ttttcaataa ggaaaaaaag atgt 164 



<210> 145 
<211> 303 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (303) 
<223> n = A,T,C or G 



<400> 145 

acgtagacca tccaactttg tatttgtaat ggcaaacatc cagnagcaat tcctaaacaa 60 

actggagggt atttataccc aattatccca ttcattaaca tgccctcctc ctcaggctat 120 

gcaggacagc tatcataagt cggcccaggc atccagatac taccatttgt ataaacttca 180 

gtaggggagt ccatccaagt gacaggtcta atcaaaggag gaaatggaac ataagcccag 240 

tagtaaaatn ttgcttaqct gaaacagcca caaaagactt accgccgtgg tgattaccat 300 

caa 303 



<210> 146 
<211> 327 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (327) 
<223> n = A,T,C or G 



<400> 146 

actgcagctc aattagaagt ggtctctgac tttcatcanc ttctccctgg gctccatgac 60 

actggcctgg agtgactcat tgctctggtt qgttgagaga gctcctttgc caacaggcct 120 

ccaagicagg gctgggattt gtttcctttc cacattctag caacaatatg ctggccactt 180 

cctgaacagg gagggtggga ggagccagca tggaacaagc tgccactttc taaagtagcc 240 

agacttgccc ctgggcctgt cacacctact gatgaccttc tgtgcctgca ggatggaatg 300 

taggggtgag ctgtgtgact ctatggt 327 



<210> 147 
<211> 173 
<212> DMA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (173) 
<223> n - A, T, C or G 
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<400> 147 

acattgtttt tttgagataa agcattgana gagctctcct taacgtgaca caatggaagg 60 

actggaacac atacccacat ctttgttctg agggataatt ttctgataaa gtcttgctgt 120 

atattcaagc acatatgtta tatattattc agttccatgt ttatagccta gtt 173 



<210> 148 

<211> 477 

<212> DNA 

<213> Homo sapien 

<22C> 

<221> misc_feature 

<222> (1) . . . (477) 

<223> n - A,T,C or G 

<400> 148 

acaaccactt tatclcatcg aatttttaac ccaaactcac tcactgtgcc tttctatcct 60 

atgggatata ttatttgatg ctccatttca tcacacatat atgaataata cactcatact 120 

gccctactac ctgctgcaat aatcacattc ccttcctgtc ctgaccctga agccattggg 180 

gtggtcctag tggccatcag tccangcctg caccttgagc ccttgagctc cattgctcac 24 0 

nccancccac ctcaccgacc ccatcctctt acacagctac ctccttgctc tctaacccca 300 

tagattatnt ccaaattcag tcaattaagt tactattaac actctacccg acatgtccag 360 

caccactggt aagccttctc cagccaacac acacacacac acacncacac acacacatat 420 

ccaggcacag gctacctcat cttcacaatc acccctttaa ttaccatgct atggtgg 477 

<210> 149 
<211> 207 
<212> DNA 
<213> Homo sapien 

<400> 149 

^ _^.ci_ ^o-,£j^^„^„^ ^^^^^^^c^i^^ wu^waci^yciy ay^^d^LL-aay dQgy cacagcidC DU 

taacgtattt tagagagcca aggaaggttt ctgtggggag tgggatgtaa ggtggggcct 120 
gatgataaat aagagtcagc caggtaagtg ggtggtgtgg tatgggcaca gtgaagaaca 180 
tttcaggcag agggaacagc agtgaaa 207 

<210> 150 

<211> 111 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> {1)...(111) 
<223> n = A,T,C or G 

<400> 150 

accttgattt cattgctgct ctgatggaaa cccaactatc taatttagct aaaacatggg 60 
cacttaaatg tggtcagtgt ttggacttgt taactantgg catctttggg t 111 

<210> 151 
<211> 196 
<212> DNA 
<213> Homo sapien 

<400> 151 

agcgcggcag gtcatattga acattccaga tacctatcat tactcgatgc tgttgataac 60 
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ou.'r^agatgg ctttgaactc agggtcacca ccagctattg gaccttacta tgaaaaccat 
acataccaac cggaaaaccc ctatcccgca cagcccactg tggtccccac tgtctacgag 
a w t. :cgg ctcagt 



120 
180 
196 



<210> 152 
^-211> 132 
-:212> DNA 
^ 213> Homo sap.ien 

^:400> 152 

a ::ciaca/"tt cacatgtaag aagggagaaa ttcctaaatg taggagaaag ataacagaac 60 

t cy^''--.-x.zt tcatctagtg gtggaaacct gatgctttat gttgacagga atagaaccag 120 

gagqgagrttgt 132 

<210> 153 
<211> 285 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 

<:222> (1) . . . (285) 

<223> n ^ A, T, C or G 

<400> 153 

acaanaccca nganaggcca ctggccgtgg tgtcatggcc tccaaacatg aaagtgtcag 60 

cttctgctct tatgtcctca tctgacaact ctttaccatt tttatcctcg ctcagcagga 120 

qcacatcaat aaagtccaaa gtcttggact tggccttggc ttggaggaag tcatcaacac 180 

cctggctagt gagggtgcgg cgccgctcct ggatgacggc atctgtgaag tcgtgcacca 240 

gtctgcaggc cctgtggaag cgccgtccac acggagtnag gaatt 285 

<210> 154 
<211> 333 
<212> DNA 
<213> Homo sapien 

<400> 154 

accacagtcc tgttgggcca gggcttcatg accctttctg tgaaaagcca tattatcacc 60 

accccaaatt tttccttaaa tatctttaac tgaaggggtc agcctcttga ctgcaaagac 120 

Gctaagccgg ttacacagct aactcccact ggccctgatt tgtgaaattg ctgctgcctg 180 

attggcacag gagtcgaagg tgttcagctc ccctcctccg tggaacgaga ctctgatttg 240 

agtttcacaa attctcgggc cacctcgtca ttgctcctct gaaataaaat ccggagaatg 300 

gtcaggcctg tctcatccat atggatcttc egg 333 

<210> 155 
<211> 308 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (308) 
223> n = A, T, C or G 

<400> 155 

:n ...M'-j-i^ata ataaaaccca catcacagtg ttgtgtcaaa gatcatcagg gcatggatgg 60 

'.^aa:.gt c;ctt tqgqaactgt aaagtgccta acacatgatc gatgattttt gttataatat 120 

i tga.t r'cjcg gtgcatacaa actctcctgc ctgctcctcc tgggccccag ccccagcccc 180 
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atcacagctc actgctctgt tcatccaggc ccagcatgta gtggctgatt cttcttggct 240 
gcttttagcc tccanaagtt tctctgaagc caaccaaacc tctangtgta aggcatgctg 300 
gccctggt 3qq 

<21G> 156 
<211> 295 
<212> DNA 
<213> Homo sapien 

<400> 156 

accttgctcg gtgcttggaa catattagga actcaaaata tgagatgata acagtgccta 60 

ttattgatta ctgagagaac tgttagacat ttagttgaag attttctaca caggaactga 120 

gaataggaga ttatgtttgg ccctcatatt ctctcctatc ctccttgcct cattctatgt 180 

ctaatatatt ctcaatcaaa taaggttagc ataatcagga aatcgaccaa ataccaatat 240 

aaaaccagat gtctatcctt aagattttca aatagaaaac aaattaacag actat 295 

<210> 157 
<211> 126 
<212> DNA 
<213> Homo sapien 



<400> 157 

acaagtttaa atagtgctgt cactgtgcat gtgctgaaat gtgaaatcca ccacatttct 60 

gaagagcaaa acaaattctg tcatgtaatc tctatcttgg gtcgtgggta tatctgtccc 120 

cttagt 126 



<210> 158 
<211> 442 
<212> DNA 
<213> Homo sapien 

<220> 

<22'1> misc feature 
<222> (1) .7. (442) 
<223> n = A,T,C or G 



<400> 158 

acccactggt cttggaaaca cccatcctta atacgatgat ttttctgtcg tgtgaaaatg 60 

aanccagcag gctgccccta gtcagtcctt ccttccagag aaaaagagat ttgagaaagt 120 

gcctgggtaa ttcaccatta atttcctccc ccaaactctc tgagtcttcc cttaatattt 180 

ctggtggttc tgaccaaagc aggtcatggt ttgttgagca tttgggatcc cagtgaagta 24 0 

natgtttgta gccttgcata cttagccctt cccacgcaca aacggagtgg cagagtggtg 300 

ccaaccctgt tttcccagtc cacgtagaca gattcacagt gcggaattct ggaagctgga 360 

nacagacggg ctctttgcag agccgggact ctgagangga catgagggcc tctgcctctg 420 

tgttcattct ctgatgtcct gt 442 



<210> 159 
<211> 498 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (498) 
<223> n = A,T,C or G 

<400> 159 

acttccaggt aacgttgttg tttccgttga gcctgaactg atgggtgacg ttgtaggttc 60 



BNSDOCID <WO 0151633A2 I > 



wo (n/51633 



PCT/l SOI/01574 



56 



^ ccaacaaga actgaggttg cagagcgggt agggaagagt gctgttccag ttgcacctgg 120 

actgctgtgg actgttgttg attcctcact acggcccaag gttgtggaac tggcanaaag 180 

gtoi gttgtt gganttgagc tcgggcggct gtggtaggtt gtgggctctt caacaggggc 240 

-sltggtg ccgggangtg aangtgttgt gtcacttgag cttggccagc tctqgaaagt 300 

antanattct tcctgaaggc cagcgcttgt ggagctggca ngggtcantg ttgtgtgtaa 360 

L:9aaccagtg ctgctgtggg tgggtgtana tcctccacaa agcctgaagt tatggtgtcn 420 

t-caggtaana atgtggfctc agtgtccctg ggcngctgtg gaaggttgta nattgtcacc 480 

d-.gggaataa gctgtggt 498 

<210> 160 
<211> 380 
<212> DNA 
<213> Homo sapien 

<220> 

<221> rrLisc_f eature 
<222> (1) . . . (380) 
<223> n = A, T, C or G 

<400> 160 

acctgcatcc agcttccctg ccaaactcac aaggagacat caacctctag acagggaaac 60 

aqcttcagga tacttccagg agacagagcc accagcagca aaacaaatat tcccatgcct 120 

ggagcatggc atagaggaag ctganaaatg tggggtctga ggaagccatt tgagtctggc 180 

cactagacat ctcatcagcc acttgtgtga agagatgccc catgacccca gatgcctctc 240 

ccacccttac ctccatctca cacacttgag ctttccactc tgtataattc taacatcctg 300 

gagaaaaatg gcagtttgac cgaacctgtt cacaacggta gaggctgatt tctaacgaaa 360 

cttgtagaat gaagcctgga 380 

<210> 161 
<211> 114 
<212> DNA 
<213> Homo sapien 

<400> 161 

actccacatc ccctctgagc aggcggttgt cgttcaaggt gtatttggcc ttgcctgtca 60 
cactgtccac tggcccctta tccacttggt gcttaatccc tcgaaagagc atgt 114 

<210> 162 
<211> 177 
<212> DNA 
<213> Homo sapien 

<400> 162 

actttctgaa tcgaatcaaa tgatacttag tgtagtttta atatcctcat atatatcaaa 60 

.}t V.t actac tctgataatt ttgtaaacca ggtaaccaga acatccagtc atacagcttt 120 

tggtgatata taacttggca ataacccagt ctggtgatac ataaaactac tcactgt 177 

<210> 163 

<211> 137 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) .7. (137} 
-'.223> n - A, T,C or G 



<400> 163 
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catttataca gacaggcgtg aagacattca cgacaaaaac gcgaaattct atcccgtgac 60 
canagaaggc agctacggct actcctacat cctggcgtgg gtggccttcg cctgcacctt 120 
catcagcggc atgatgt 137 

<210> 164 
<211> 469 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (469) 
<223> n - A,T,C or G 



<400> 164 

cttatcacaa tgaatgttct cctgggcagc gttgtgatct ttgccacctt cgtgacttta 60 

tgcaatgcat catgctattt catacctaat gagggagttc caggagattc aaccaggaaa 120 

tgcatggatc tcaaaggaaa caaacaccca ataaactcgg agtggcagac tgacaactgt 180 

gagacatgca cttgctacga aacagaaatt tcatgttgca cccttgtttc tacacctgtg 240 

ggttatgaca aagacaactg ccaaagaatc ttcaagaagg aggactgcaa gtatatcgtg 300 

gtggagaaga aggacccaaa aaagacctgt tctgtcagtg aatggataat ctaatgtgct 360 

tctagtaggc acagggctcc caggccaggc ctcattctcc tctggcctct aatagtcaat 420 

gattgtgtag ccatgcctat cagtaaaaag atntttgagc aaacacttt 469 



<210> 165 
<211> 195 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (195) 

<223> n A;T,C or G 



<400> 165 

acagtttttt atanatatcg acattgccgg cacttgtgtt cagtttcata aagctggtgg 60 

atccgctgtc atccactatt ccttggctag agtaaaaatt attcttatag cccatgtccc 120 

tgcaggccgc ccgcccgtag ttctcgttcc agtcgtcttg gcacacaggg tgccaggact 180 

tcctctgaga tgagt 195 



<210> 166 
<211> 383 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc^feature 
<222> (1) . . . (383) 
<223> n - A,T,C or G 



<400> 166 

acatcttagt agtgtggcac atcagggggc catcagggtc acagtcactc atagcctcgc 60 

cgaggtcgga gtccacacca ccggtgtagg tgtgctcaat cttgggcttg gcgcccacct 120 

ttggagaagg gatatgctgc acacacatgt ccacaaagcc tgtgaactcg ccaaagaatt 180 

tttgcagacc agcctgagca aggggcggat gttcagcttc agctcctcct tcgtcaggtg 24 0 

gatgccaacc tcgtctangg tccgtgggaa gctggtgtcc acntcaccta caacctgggc 300 

gangatctta taaagaggct ccnagataaa ctccacgaaa cttctctggg agctgctagt 360 

nggggccttt ttggtgaact ttc 383 
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<210> 16'7 
~ 2ll> 247 
212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (247) 
<223> n - A,T,C or G 



<4Q0> 167 

acagagccag accttggcca taaatgaanc agagattaag actaaacccc aagtcganat 60 

tggagoagaa actggagcaa gaagtgggcc tggggctgaa gtagagacca aggccactgc 120 

tatanccata cacagagcca actctcaggc caaggcnatg gttggggcag anccagagac 180 

tcaatctgan tccaaagtgg tggctggaac actggtcatg acanaggcag tgactctgac 240 

tgangtc 247 



<2i0> 168 
<211> 273 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (273) 
<223> n = A, T, C or G 



<400> 168 

acttctaagt tttctagaag tggaaggatt gtantcatcc tgaaaatggg tttacttcaa 60 

aatcGCtcan ccttgttctt cacnactgtc tatactgana gtgtcatgtt tccacaaagg 120 

gctgacacct gagcctgnat tttcactcat ccctgagaag ccctttccag tagggtgggc 180 

aatt.cccaac ttccttgcca caagcttccc aggctttctc ccctggaaaa ctccagcttg 240 

aqlcccagat acactcatgg gctgccctgg gca 273 



<210> 169 
<211> 431 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc^feature 
<222> (1) . . . (431) 
<223> n = A, T, C or G 



<400> 169 

acagccttgg cttccccaaa ctccacagtc tcagtgcaga aagatcatct tccagcagtc 60 

agctcagacc agggtcaaag gatgtgacat caacagtttc tggtttcaga acaggttcta 120 

ctactgtcaa atgacccccc atacttcctc aaaggctgtg gtaagttttg cacaggtgag 180 

ggcagcagaa agggggtant tactgatgga caccatcttc tctgtatact ccacactgac 240 

cttgccatgg gcaaaggccc ctaccacaaa aacaatagga tcactgctgg gcaccagctc 300 

acgcacatca ctgacaaccg ggatggaaaa agaantgcca actttcatac atccaactgg 360 

aaaqtgatct gatactggat tcttaattac cttcaaaagc ttctgggggc catcagctgc 420 

tcgaacactg a 431 



<210> 170 
<211> 266 
<212> DNA 
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<213> Homo sapien 
<220> 

<221> misc_feature 
<222> (1) . . . (266) 
<223> n = A,T,C or G 

<400> 170 

acctgtgggc tgggctgtta tgcctgtgcc ggctgctgaa agggagttca gaggtggagc 
tcaaggagct ctgcaggcat tttgccaanc ctctccanag canagggagc aacctacact 
ccccgctaga aagacaccag attggagtcc tgggaggggg agttggggtg ggcatttgat 
gtatacttgt cacctgaatg aangagccag agaggaanga gacgaanatg anattggcct 
tcaaagctag gggtctggca ggtgga 

<210> 171 

<211> 1248 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_feature 
<222> (1) . . . (1248) 
<223> n ^ A,T,C or G 



60 
120 
180 
240 
266 



<400 
ggcagccaaa 
ctggtcatgg 
tcagccgcac 
cacagtcttg 
cggcacccag 
gaatccgtgt 
gcggggaact 
gtgctgcagt 
ccgctgtacc 
aacggtgact 
ggaaaagccc 
actgagtgga 
attgaccccc 
ccctcaggcc 
cccagcccct 
ccaggagtcc 
ctcagaccca 
ccaacccntc 
gcggtccaat 
aaccttacca 
aagagaagng 



> 171 
tcataaacgg 
aaaacgaatt 
actgtttcca 
aggccgacca 
agtacaacag 
Gcgagtctga 
cttgcctcgt 
gcgtgaacgt 
accccagcat 
ctggggggcc 
cgtgtggcca 
tagagaaaac 
aaatacatcc 
caggagtcca 
cctccctcag 
agcccctcct 
ggggtccagg 
att ccccaga 
gccacctaga 
gttggttttt 
caaaaaaaaa 



cgaggactgc 
gttctgctcg 
gaagtgagtg 
agagccaggg 
acccttgctc 
caccatccgg 
ttctggctgg 

gttctgcgcc 
cctgatctgc 
agttggcgtg 
cgtccaggcc 
tgcggaagga 
ggcccccagc 
acccaggagt 
ccctcagacc 
cccccaaccc 
cccagaggtc 
ctntccctgt 
catttttngt 
aaaaaaaaaa 



agcccgcact 
ggcgtcctgg 
cagagctcct 
agccagatgg 
gctaacgacc 
agcatcagca 
ggtctgctgg 
tctgaggagg 
ggcggagggc 
aacgggtact 
ccaggtgtct 
agttaactct 
attcaggaat 
ccctcctccc 
ccagaccccc 
caggagtcca 
ctcctccctc 
caggtcccag 
acacagtgcc 
ccctttcccc 
aaaaaaaaaa 



cgcagccctg 
tgcatccgca 
acaccatcgg 
tggaggccag 
tcatgctcat 
ttgcttcgca 
cgaacggcag 
t ctgcagtaa 
aagaccagaa 
tgcagggcct 
acaccaacct 
ggggactggg 
atctgttccc 
tcaaaccaag 
cagcccctcc 
gaccccccag 
agactcagag 
cccctcntcc 
cccttgtggc 
tagatccaga 
aaaaaaaa 



gcaggcggca 
gtgggtgctg 
gctgggcctg 
cctctccgta 
caagttggac 
gtgccctacc 
aatgcctacc 
gctctatgac 
ggactcctgc 
tgtgtctttc 
ctgcaaattc 
aacccatgaa 
agcccctcct 
ggtacagatc 
tccctcagac 
cccctcctcc 
gtccaagccc 
ctcagaccca 
acgttgaccc 
aataaagttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1248 



<210> 172 
<211> 159 
<212> PRT 
<213> Homo sapien 

<220> 

<221> VARIANT 

<222> (1) . . . (159) 

<223> Xaa = Any Amino Acid 



<400> 172 
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60 



Met 


Val 


Glu 


Ala 


Ser 


Leu 


Ser 


Val 


Arg 


His 


Pro 


Glu 


Tyr 


Asn 


Arg 


Pro 


1 








5 










10 










15 






LiOu 


A.] a 


Asn 
20 


Asp 


Leu 


Met 


Leu 


He 
25 


Lys 


Leu 


Asp 


Glu 


Ser 
30 


Val 


Ser 


Glu 


Ser 


Asp 
35 


Thr 


He 


Arg 


Ser 


He 
40 


Ser 


He 


Ala 


Ser 


Gin 
45 


Cys 


Pro 


Thr 




Gly 


Asn 


Ser 


Cys 


Leu 


Val 


Ser 


Gly 


Trp 


Gly 


Leu 


Leu 


Ala 


Asn 


Gly 




50 










5 5 










60 








Arg 


Met 


Pro 


Thr 


Val 


Leu 


Gin 


Cys 


Val 


Asn 


Val 


Ser 


Val 


Val 


Ser 


Glu 


6 5 










70 










75 










80 


c;i J 


Val 


Cys 


Ser 


Lys 
85 


Leu 


Tyr 


Asp 


Pro 


Leu 
90 


Tyr 


His 


Pro 


Ser 


Met 
95 


Phe 


Cys 


Ala 


Gly 


Gly 
100 


Gly 


Gin 


Xaa 


Gin 


Xaa 

105 


Asp 


Ser 


Cys 


Asn 


Gly 
110 


Asp 


Ser 


Gly 


Gly 


Pro 
115 


Leu 


He 


Cys 


Asn 


Gly 
120 


Tyr 


Leu 


Gin 


Gly 


Leu 
125 


Val 


Ser 


Phe 


Gly 


Lys 
130 


Ala 


Pro 


Cys 


Gly 


Gin 

135 


Val 


Gly 


Val 


Pro 


Gly 
140 


Val 


Tyr 


Thr 


Asn 


jjeu 


Cys 


Lys 


Phe 


Thr 


Glu 


Trp 


He 


Glu 


Lys 


Thr 


Val 


Gin 


Ala 


Ser 




145 










150 










155 













<210> 173 

<211> 1265 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc^f eature 
<222> (1) . . . (1265) 
<223> n = A,T,C or G 



<400> 173 

gqcagcccgc actcgcagcc ctggcaggcg gcactggtca tggaaaacga attgttctgc 60 

■ cyggcgtcc tggtgcatcc gcagtgggtg ctgtcagccg cacactgttt ccagaactcc 120 

tacaccatcg ggctgggcct gcacagtctt gaggccgacc aagagccagg gagccagatg 180 

gtggaggcca gcctctccgt acggcaccca gagtacaaca gacccttgct cgctaacgac 240 

ctcatgctca tcaagttgga cgaatccgtg tccgagtctg acaccatccg gagcatcagc 300 

attgcttcgc agtgccctac cgcggggaac tcttgcctcg tttctggctg gggtctgctg 360 

gcgaacggtg agctcacggg tgtgtgtctg ccctcttcaa ggaggtcctc tgcccagtcg 420 

cgggggctga cccagagctc tgcgtcccag gcagaatgcc taccgtgctg cagtgcgtga 480 

acgtgtcggt ggtgtctgag gaggtctgca gtaagctcta tgacccgctg taccacccca 540 

gcatgttctg cgccggcgga gggcaagacc agaaggactc ctgcaacggt gactctgggg 600 

ggcccctgat ctgcaacggg tacttgcagg gccttgtgtc tttcggaaaa gccccgtgtg 660 

gccaagttgg cgtgccaggt gtctacacca acctctgcaa attcactgag tggatagaga 720 

daaccgtcca ggccagttaa ctctggggac tgggaaccca tgaaattgac ccccaaatac 780 

dtcctgcgga aggaattcag gaatatctgt tcccagcccc tcctccctca ggcccaggag 840 

tccaggcccc cagcccctcc tccctcaaac caagggtaca gatccccagc ccctcctccc 900 

tcagacccag gagtccagac cccccagccc ctcctccctc agacccagga gtccagcccc 960 

tcctccntca gacccaggag tccagacccc ccagcccctc ctccctcaga cccaggggtt 1020 

gaggccccca acccctcctc cttcagagtc agaggtccaa gcccccaacc cctcgttccc 1080 

cagacccaga ggtnnaggtc ccagcccctc ttccntcaga cccagnggtc caatgccacc 1140 

tagattttcc ctgnacacag tgcccccttg tggnangttg acccaacctt accagttggt 1200 

ttttcatttt tngtcccttt cccctagatc cagaaataaa gtttaagaga ngngcaaaaa 1260 

aaaaa 1265 



<210> 174 
<211> 1459 
<212> DNA 



w o 01/51633 



61 



PCT/IISO 1/0 1574 



<213> Homo sapien 
<220> 

<221> misc_f eature 
<222> (1) . . . (1459) 

<223> n - A, C or G 



<400> 174 

ggtcagccgc acactgtttc cagaagtgag tgcagagctc ctacaccatc gggctgggcc 60 

tgcacagtct tgaggccgac caagagccag ggagccagat ggtggaggcc agcctctccg 120 

tacggcaccc agagtacaac agacccttgc tcgctaacga cctcatgctc atcaagttgg 180 

acgaatccgt gtccgagtct gacaccatcc ggagcatcag cattgcttcg cagtgcccta 24 0 

ccgcggggaa ctcttgcctc gtttctggct ggggtctgct ggcgaacggt gagctcacgg 300 

gtgtgtgtct gccctcttca aggaggtcct ctgcccagtc gcgggggctg acccagagct 360 

ctgcgtccca ggcagaatgc ctaccgtgct gcagtgcgtg aacgtgtcgg tggtgtctga 420 

ngaggtctgc antaagctct atgacccgct gtaccacccc ancatgttct gcgccggcgg 480 

sgggcaagac cagaaggact cctgcaacgt gagagagggg aaaggggagg gcaggcgact 54 0 

cagggaaggg tggagaaggg ggagacagag acacacaggg ccgcatggcg agatgcagag 600 

atggagagac acacagggag acagtgacaa ctagagagag aaactgagag aaacagagaa 660 

ataaacacag gaataaagag aagcaaagga agagagaaac agaaacagac atggggaggc 720 

agaaacacac acacatagaa atgcagttga ccttccaaca gcatggggcc tgagggcggt 780 

gacctccacc caatagaaaa tcctcttata acttttgact ccccaaaaac ctgactagaa 84 0 

atagcctact gttgacgggg agccttacca ataacataaa tagtcgattt atgcatacgt 900 

tttatgcatt catgatatac ctttgttgga attttttgat atttctaagc tacacagttc 960 

gtctgtgaat ttttttaaat tgttgcaact ctcctaaaat ttttctgatg tgtttattga 1020 

aaaaatccaa gtataagtgg acttgtgcat tcaaaccagg gttgttcaag ggtcaactgt 1080 

gtacccagag ggaaacagtg acacagattc atagaggtga aacacgaaga gaaacaggaa 1140 

aaatcaagac tctacaaaga ggctgggcag ggtggctcat gcctgtaatc ccagcacttt 1200 

gggaggcgag gcaggcagat cacttgaggt aaggagttca agaccagcct ggccaaaatg 1260 

gtgaaatcct gtctgtacta aaaatacaaa agttagctgg atatggtggc aggcgcctgt 1320 

aatcccagct acttgggagg ctgaggcagg agaattgctt gaatatggga ggcagaggtt 138 0 

gaagtgagtt gagatcacac cactatactc cagctggggc aacagagtaa gactctgtct 144 0 

caaaaaaaaa aaaaaaaaa 1453 



<210> 175 

<211> 1167 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (1167) 
<223> n = A,T,C or G 



<400> 175 

gcgcagccct ggcaggcggc actggtcatg gaaaacgaat tgttctgctc gggcgtcctg 60 

gtgcatccgc agtgggtgct gtcagccgca cactgtttcc agaactccta caccatcggg 120 

c^tgggcctgc acagtcttga ggccgaccaa gagccaggga gccagatggt ggaggccagc 180 

ctctccgtac ggcacccaga gtacaacaga ctcttgctcg ctaacgacct catgctcatc 24 0 

aagttggacg aatccgtgtc cgagtctgac accatccgga gcatcagcat tgcttcgcag 300 

tgccctaccg cggggaactc ttgcctcgtn tctggctggg gtctgctggc gaacggcaga 360 

atgcctaccg tgctgcactg cgtgaacgtg tcggtggtgt ctgaggangt ctgcagtaag 420 

ctctatgacc cgctgtacca ccccagcatg ttctgcgccg gcggagggca agaccagaag 480 

gactcctgca acggtgactc tggggggccc ctgatctgca acgggtactt gcagggcctt 54 0 

gtgtctttcg gaaaagcccc gtgtggccaa cttggcgtgc caggtgtcta caccaacctc 600 

tgcaaattca ctgagtggat agagaaaacc gtccagncca gttaactctg gggactggga 660 

acccatgaaa ttgaccccca aatacatcct gcggaangaa ttcaggaata tctgttccca 720 

gcccctcctc cctcaggccc aggagtccag gcccccagcc cctcctccct caaaccaagg 78 0 



BNSDOCID: <WO 01 51633A2. )_. > 



\\ O (n /5U>33 
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atacagat cc ccagcccctc ctccctcaga cccaggagtc cagacccccc agcccctcnt 84 0 

ccntcagacc caggagtcca gcccctcctc cntcagacgc aggagtccag accccccagc 900 

-'cn^ cnzccg tcagacccaq gggtgcaggc ccccaacccc zcnzccntca gagtcagacg 960 

cc^..:iqcccc caacccctcg ttccccagac ccagaggtnc aggtcccagc ccctcctccc 1020 

tcagacccag cggtccaatg ccacctagan tntccctgta cacagtgccc ccttgtggca 1080 

'j.'ttgaccca accttaccag ttggtttttc attttttgtc cctttcccct agatccagaa 1140 

ataaagtntaagagaaqcgcaaaaaaa 1167 



<210> 176 
<2i:> 205 
<212> PRT 
<213> Homo sapien 

< 2 2 C > 

<221> VAKIA^JT 

<222- ( 1) . . . (205) 

<22 3> Xda Any Amino Acid 



<4 00> 176 



Met 


Glu 




Gi u 


Leu 


Phe 


Cys 


Ser 


Gly 


Val 


Leu 


Val 


His 


Pro 


Gin 


Trp 


1 








5 










10 










15 




Val 




Ser 


A 1 a 

2 0 


Ala 


His 


Cys 


Phe 


Gin 
25 


Asn 


Ser 


Tyr 


Thr 


He 
30 


Gly 


Leu 


Gly 


Leu 


His 

35 


Ser 


Leu 


Glu 


Ala 


Asp 
40 


Gin 


Glu 


Pro 


Gly 


Ser 
45 


Gin 


Met 


Val 


Glu 


Ala 
50 


Ser 


Leu 


Ser 


Val 


Arg 
55 


Kis 


Pro 


Glu 


Tyr 


Asn 
60 


Arg 


Leu 


Leu 


Leu 


Ala 


Asn 


Asp 


Leu 


Met 


Leu 


He 


Lys 


Leu 


Asp 


Glu 


Ser 


Val 


Ser 


Glu 


Ser 


65 










70 










75 










80 


Asp 


Thr 


lie 


Arg 


Ser 
85 


He 


Ser 


He 


Ala 


Ser 
90 


Gin 


Cys 


Pro 


Thr 


Ala 
95 


Gly 


Asn 


Ser 


Cys 


Leu 
100 


Val 


Ser 


Gly 


Trp 


Gly 
105 


Leu 


Leu 


Ala 


Asn 


Gly 
110 


Arg 


Met 


Pro 


Thr 


Val 
115 


Leu 


His 


Cys 


Val 


Asn 
120 


Val 


Ser 


Val 


Val 


Ser 
125 


Glu 


Xaa 


Val 


Cys 


Ser 
130 


Lys 


Leu 


Tyr 


Asp 


Pro 
135 


Leu 


Tyr 


His 


Pro 


Ser 
140 


Met 


Phe 


Cys 


Ala 




Gly 


Gly 


Gin 


Asp 


Gin 


Lys 


Asp 


Ser 


Cys 


Asn 


Gly 


Asp 


Ser 


Gly 


Gly 


145 










150 










155 










160 


Pro 


Leu 


lie 


Cys 


Asn 
165 


Gly 


Tyr 


Leu 


Gin 


Gly 
170 


Leu 


Val 


Ser 


Phe 


Gly 
175 


Lys 


Ala 


Pro 


Cys 


Gly 
180 


Gin 


Leu 


Gly 


Val 


Pro 
185 


Gly 


Val 


Tyr 


Thr 


Asn 
190 


Leu 


Cys 


Lys 


Phe 


Thr 

195 


Glu 


Trp 


He 


Glu 


Lys 
200 


Thr 


Val 


Gin 


Xaa 


Ser 
205 









<210> 177 

<211> 1119 

<212> DNA 

<213> Hc:iTTio sapien 



<400> 177 

qcgcactcgc aqccctggca ggcggcactg gtcatggaaa acgaattgtt ctgctcgggc 60 

gtcctggtgc atccgcagtg ggtgctgtca gccgcacact gtttccagaa ctcctacacc 120 

atcgggctgg gcctgcacag tcttgaggcc gaccaagagc cagggagcca gatggtggag 180 

gccagcctct ccgtacggca cccagagtac aacagaccct tgctcgctaa cgacctcatg 240 

c:tcatcaagt tggacgaatc cgtgtccgag tctgacacca tccggagcat cagcattgct 300 

tcgcagtgcc ctaccgcggg gaactcttgc ctcgtttctg gctggggtct gctggcgaac 360 



wo 01/5K)33 
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gatgctgtga 
caaccctggc 
ctcactgggt 
caccatagtt 
actaaccatg 
cagttatcct 
tgacctacag 
ttcatttct c 
ggtcacaatg 
ctcagtacac 
accacctcag 
gaggtgaggg 
ttaataaaca 



ttgccatcca 
agggttgtac 
gctcactact 
ctccgaagtc 
ccgatgttta 
cactgaattg 
aggtgaggga 
ctgttgtagt 
atgaatgtat 
cagggcaggt 
gactcctgga 
agagggccca 
gaagctgtga 



gtcccagact 
catttcggca 
gctcactgca 
agactatcat 
ggtgaaatta 
agatttcctg 
tcatatagct 
gaaaggtgcg 
gatcgtgttc 
ctagcatttc 
ttctctgcct 
tggttcaatg 
tgttaaaaaa 



gtgggaggct 
acttccagtg 
tcacccggaa 
gattactgtg 
gcgtcacttg 
cttcagtgtc 
cttcaaggat 
ccctctggag 
ccattaccca 
ttcatttagt 
agttgagctc 
ggatctgtgc 
aaaaaaaaa 



gggagtgtga 
caaggacgtc 
cactgtgatc 
ttgactgtgc 
gcctcaacca 
agccattccc 
gctggtactc 
cctcccaggg 
aagcctttaa 
gtatgctgtc 
ctgcatgctg 
agttgtaaca 



gaagctttcc 
ctgctgcatc 
aactagccag 
tgtctattgt 
tcttggtatc 
acataatttc 
ccctcacaaa 
tgggtgtgca 
atccctcatg 
cattcatgca 
cctccttggg 
cattaggtgc 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1119 



<210> 178 
<211> 164 
<212> PRT 
<213> Homo sapien 



<220> 

<221> VARIANT 

<222> (1) . . . (164) 

<223> Xaa = Any Amino Acid 



<400> 178 
Met Glu Asn Glu 
1 

Val Leu Ser Ala 
20 

Gly Leu His Ser 
35 

Glu Ala Ser Leu 
5 0 

Ala Asn Asp Leu 
65 

Asp Thr lie Arg 

Asn Ser Cys Leu 
100 

He Ala He Gin 
115 

Ser Gin Pro Trp 
130 

Thr Ser Cys Cys 
145 

Pro Gly Thr Leu 



Leu 


Phe 


Cys 


Ser 


5 








Ala 


His 


Cys 


Phe 


Leu 


Glu 


Ala 


Asp 








40 


Ser 


Val 


Arg 


His 






55 




Met 


Leu 


lie 


Lys 




70 






Ser 


He 


Ser 


He 


85 








Val 


Ser 


Gly 


Trp 


Ser 


Xaa 


Thr 


Val 








120 


Gin 


Gly Cys 


Thr 






135 




He 


Leu 


Thr 


Gly 




150 







Gly Val Leu Val 
10 

Gin Asn Ser Tyr 

25 

Gin Glu Pro Gly 

Pro Glu Tyr Asn 
60 

Leu Asp Glu Ser 
75 

Ala Ser Gin Cys 
90 

Gly Leu Leu Ala 
105 

Gly Gly Trp Glu 

He Ser Ala Thr 
140 

Cys Ser Leu Leu 
155 



His Pro Gin Trp 
15 

Thr He Gly Leu 
30 

Ser Gin Met Val 
45 

Arg Pro Leu Leu 

Val Ser Glu Ser 
80 

Pro Thr Ala Gly 
95 

Asn Asp Ala Val 
110 

Cys Glu Lys Leu 
125 

Ser Ser Ala Arg 

Leu Thr Ala Ser 
160 



<210> 179 
<211> 250 
<212> DNA 
<213> Homo sapien 

<400> 179 

ctggagtgcc ttggtgtttc aagcccctgc aggaagcaga atgcaccttc tgaggcacct 60 

ccagctgccc ccggccgggg gatgcgaggc tcggagcacc cttgcccggc tgtgattgct 120 

gccaggcact gttcatctca gcttttctgt ccctttgctc ccggcaagcg cttctgctga 180 

aagttcatat ctggagcctg atgtcttaac gaataaaggt cccatgctcc acccgaaaaa 24 0 



BNSDOCID <WO 0151633A2 I > 



wo (H/51(>33 
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PC I/l S(M/01574 



addenda a 250 

^;210> 180 

<212> DNA 

< 2 1 3 > H omo s ap i e n 



<-.-100> 180 

^rt.agi.ccag tgtggtggaa ttccattgtg ttgggcccaa cacaatggct acctttaaca 60 

L c: :...:Gcagac cccgcccctg cccgtgcccc acgctgctgc taacgacagt atgatgctra 120 

^ act cggaaactat ttttatgtaa ttaatgtatg ctttcttgtt tataaatgcc 180 

' urj 1' vdaaa aaaaaaaaaa aa 202 



=-210> 181 
^211> 558 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1) . . . (558) 

<223> n = A,T,C or G 



<400> 181 

Lccytttgkt naggtttkkg agacamccck agacctwaan ctgtgtcaca gacttcyngg 60 

aatgtttagg cagtgctagt aatttcytcg taatgattct gttattactt tcctnattct 120 

ttattcctct ttcttctgaa gattaatgaa gttgaaaatt gaggtggata aatacaaaaa 160 

ggtagtgtga tagtataagt atctaagtgc agatgaaagt gtgttatata tatccattca 240 

aaattatgca agttagtaat tactcagggt taactaaatt actttaatat gctgttgaac 300 

ctactctgtt ccttggctag aaaaaattat aaacaggact ttgttagttt gggaagccaa 360 

attgataata ttctatgttc taaaagttgg gctatacata aattattaag aaatatggaw 420 

ttttattccc aggaatatgg kgttcatttt atgaatatta cscrggatag awgtwtgagt 480 

iaaaycagtt ttggtwaata ygtwaatatg tcmtaaataa acaakgcttt gacttatttc 540 

naaaaaaaaa aaaaaaaa 558 



<210> 182 
<211> 479 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc^feature 
<222> (!}... (479) 
<223> n = A,T,C or G 



<400> 182 

acagggwttk grggatgcta agsccccrga rwtygtttga tccaaccctg gcttwttttc 60 

agaggggaaa atggggccta gaagttacag mscatytagy tggtgcgmtg gcacccctgg 120 

cstcacacag astcccgagt agctgggact acaggcacac agtcactgaa gcaggccctg 180 

ttwgcaattc acgttgccac ctccaactta aacattcttc atatgtgatg tccttagtca 240 

ctaaggttaa actttcccac ccagaaaagg caacttagat aaaatcttag agtactttca 300 

tactmttcta agtcctcttc cagcctcact kkgagtcctm cytgggggtt gataggaant 360 

/itctct.tggc tttctcaata aartctctat ycatctcatg tttaatttgg tacgcatara 420 

awtqstgara aaattaaaat gttctggtty mactttaaaa araaaaaaaa aaaaaaaaa 479 



<210> 183 
<211> 384 
<212> DNA 



w o 01/51633 



PCT/llS01/(n574 



65 



<213> Homo sapien 



<400> 183 

aggcgggagc agaagctaaa gccaaagccc aagaagagtg gcagtgccag cactggtgcc 60 

agtaccagta ccaataacag tgccagtgcc agtgccagca ccagtggtgg cttcagtgct 120 

ggtgccagcc tgaccgccac tctcacattt gggctcttcg ctggccttgg tggagctggt 180 

gccagcacca gtggcagctc tggtgcctgt ggtttctcct acaagtgaga ttttagatat 240 

tgttaatcct gccagtcttt ctcttcaagc cagggtgcat cctcagaaac ctactcaaca 300 

cagcactcta ggcagccact atcaatcaat tgaagttgac actctgcatt aratctattt 360 

gccatttcaa aaaaaaaaaa aaaa 384 



<210> 184 
<211> 496 
<212> DNA 
<213> liomo sapien 

<220> 

<221> niisc_f eature 
<222> (!}... (^.96J 
<223> n - A,T,C or G 



<400> 184 

accgaattgg gaccgctggc ttataagcga tcatgtyynt ccrgtatkac ctcaacgagc 60 

agggagatcq aqtctatacg ctgaagaaat ttgacccgat gggacaacag acctgctcag 120 

cccatcctgc tcggttctcc ccagatgaca aatactctsg acaccgaatc accatcaaga 180 

aacgcttcaa ggtgctcatg acccagcaac cgcgccctgt cctctgaggg tcccttaaac 240 

tgatgtcttt tctgccacct gttacccctc ggagactccg taaccaaact cttcggactg 300 

tgagccctga tgcctttttg ccagccatac tctttggcat ccagtctctc gtggcgattg 360 

attatgcttg tgtgaggcaa tcatggtggc atcacccata aagggaacac atttgacttt 420 

tttttctcat altttaaatt actacmagaw tattwiuagaw waaatgawtt gaaaaactst 480 

taaaaaaaaa aaaaaa aq^; 



<210> 185 
<211> 384 
<212> DNA 
<213> Homo sapien 



<400> 185 

gctggtagcc tatggcgkgg cccacggagg ggctcctgag gccacggrac agtgacttcc 60 

caagtatcyt gcgcsgcgtc ttctaccgtc cctacctgca gatcttcggg cagattcccc 120 

aggaggacat ggacgtggcc ctcatggagc acagcaactg ytcgtcggag cccggcttct 180 

gggcacaccc tcctggggcc caggcgggca cctgcgtctc ccagtatgcc aactggctgg 24 0 

tggtgctgct cctcgtcatc ttcctgctcg tggccaacat cctgctggtc aacttgctca 300 

ttgccatgtt cagttacaca ttcggcaaag tacagggcaa cagcgatctc tactgggaag 360 

gcgcagcgtt accgcctcat ccgg 384 



<210> 186 
<211> 577 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 

<222> (1) . . . (577) 

<223> n = A,T,C or G 

<400> 186 

gagttagctc ctccacaacc ttgatgaggt cgtctgcagt ggcctctcgc ttcataccgc 60 



BNSDOCID <WC 0151633A2 I > 
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' ! ic ."cit cgt c atactgtagg tttgccacca cytcctggca tcttggggcg gcntaatatt 120 

ccaqgaaact crcaatcaag tcaccgtcga tgaaacctgr gggctggttc tgtcttccgc 180 

':^;-itqtgaa aggatctccc agaaggagtg ctcgatcttc cccacacttt tgatgacttt 240 

. ' ^„ q;:^ gt cga ttctgcatgt ccagcaggag gttgtaccag ctctctgaca gtgaggtcac 300 

caqccctatc atgccgttga mcgtgccgaa garcaccgag ccttgtgtgg gggkkgaagt 360 

::tcacr:caga ttctgcatta ccagagagcc gtqgcaaaag acattgacaa actcgcccag 420 

ntaci^aaaag amcamctcct ggargtgctn gccgctcctc gtcmgttggt ggcagcgctw 4 80 

^.■cttttgac acacaaacaa gttaaaggca ttttcagccc ccagaaantt gtcatcatcc 540 

i^agatntcgc acagcactna tccagttggg attaaat 577 



^-210> 187 
<211> 534 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1 ) . . . (534 ) 
<223> n = A,T,C or G 



<400> 187 

aacatcttcc tgtataatgc tgtgtaatat cgatccgatn ttgtctgstg agaatycatw 60 

dct.kggaaaa gmaacattaa agcctggaca ctggtattaa aattcacaat atgcaacact 120 

it.aaacagtg tgtcaatctg ctcccyynac tttgtcatca ccagtctggg aakaagggta 180 

tgccctattc acacctgtta aaagggcgct aagcattttt gattcaacat cttttttttt 240 

gacacaagtc cgaaaaaagc aaaagtaaac agttatyaat ttgttagcca attcactttc 300 

ttcatgggac agagccatyt: gatttaaaaa gcaaattgca taatattgag cttygggagc 360 

tgatatttga gcggaagagt agcctttcta cttcaccaga cacaactccc tttcatattg 420 

ggatgttnac naaagtwatg tctctwacag atgggatgct tttgtggcaa ttctgttctg 480 

aggatctccc agtttattta ccacttgcac aagaaggcgt tttcttcctc aggc 534 



<210> 188 
<211> 761 
<212> DNA 
<213> }3omo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (761) 
<223> n = A, T,C or G 



<400> 188 

agaaaccagt atctctnaaa acaacctctc ataccttgtg gacctaattt tgtgtgcgtg 60 

tgtgtgtgcg cgcatattat atagacaggc acatcttttt tacttttgta aaagcttatg 120 

■:c^t ctttggt atctatatct gtgaaagttt taatgatctg ccataatgtc ttggggacct 180 

ttgtcttctg tgtaaatggt actagagaaa acacctatnt tatgagtcaa tctagttngt 240 

tttattcgac atgaaggaaa tttccagatn acaacactna caaactctcc ctkgackarg 300 

ggggacaaag aaaagcaaaa ctgamcataa raaacaatwa cctggtgaga arttgcataa 360 

acagaaatwr ggtagtatat tgaarnacag catcattaaa rmgttwtktt wttctccctt 420 

gcaaaaaaca tgtacngact tcccgttgag taatgccaag ttgttttttt tatnataaaa 480 

cttgcccttc attacatgtt tnaaagtggt gtggtgggcc aaaatattga aatgatggaa 540 

ctqactgata aagctgtaca aataagcagt gtgcctaaca agcaacacag taatgttgac 600 

atgcttaatt cacaaatgct aatttcatta taaatgtttg ctaaaataca ctttgaacta 660 

vttttctgtn ttcccagagc tgagatntta gattttatgt agtatnaagt gaaaaantac 720 

gaaaataata acattgaaga aaaananaaa aaanaaaaaa a 761 



<210> 189 
<211> 482 
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<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> (1) . . . (482) 
<223> n = A, T,C or G 



<400> 189 

tttttttttt tttgccgatn ctactatttt attgcaggan gtgggggtgt atgcaccgca 60 

caccggggct atnagaagca agaaggaagg agggagggca cagccccttg ctgagcaaca 120 

aagccgcctg ctgccttctc tgtctgtctc ctggtgcagg cacatgggga gaccttcccc 180 

aaggcagggg ccaccagtcc aggggtggga atacaggggg tgggangtgt gcataagaag 240 

tgataggcac aggccacccg gtacagaccc ctcggctcct gacaggtnga tttcgaccag 300 

gtcattgtgc cctgcccagg cacagcgtan atctggaaaa gacagaatgc tttccttttc 360 

aaatttggct ngtcatngaa ngggcanttt tccaanttng gctnggtctt ggtacncttg 420 

gttcggccca gctccncgtc caaaaantat tcacccnnct ccnaattgct tgcnggnccc 480 



482 

<210> 190 
<211> 471 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (471) 
<223> n = A,T,C or G 



<400> 190 

tttttttttt ttttaaaaca gtttttcaca acaaaattta ttagaagaat agtggttttg 60 

aaaactctcg catccagtga gaactaccat acaccacatt acagctngga atgtnctcca 120 

aatgtctggt caaatgatac aatggaacca ttcaatctta cacatgcacg aaagaacaag 180 

cgcttttgac atacaatgca caaaaaaaaa aggggggggg gaccacatgg attaaaattt 240 

taagtactca tcacatacat taagacacag ttctagtcca gtcnaaaatc agaactgcnt 300 

tgaaaaattt catgtatgca atccaaccaa agaacttnat tggtgatcat gantnctcta 360 

ctacatcnac cttgatcatt gccaggaacn aaaagttnaa ancacncngt acaaaaanaa 420 

tctgtaattn anttcaacct ccgtacngaa aaatnttnnt tatacactcc c 4 71 



<210> 191 
<211> 402 
<212> DNA 
<213> Homo sapien 

<220> 

<221> niisc_f eature 
<222> (1) . . . (402) 
<223> n = A, T,C or G 



<400> 191 

gagggattga aggtctgttc tastgtcggm ctgttcagcc accaactcta acaagttgct 60 

gtcttccact cactgtctgt aagcttttta acccagacwg tatcttcata aatagaacaa 120 

attcttcacc agtcacatct tctaggacct ttttggattc agttagtata agctcttcca 180 

cttcctttgt taagacttca tctggtaaag tcttaagttt tgtagaaagg aattyaattg 240 

ctcgttctct aacaatgtcc tctccttgaa gtatttggct gaacaaccca cctaaagtcc 300 

ctttgtgcat ccattttaaa tatacttaat agggcattgk tncactaggt taaattctgc 360 

aagagtcatc tgtctgcaaa agttgcgtta gtatatctgc ca 402 
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<210> 192 
<211> 601 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc _f eature 
<222> (1) .'. . (601) 
<223> n = A,T,C or G 



<400> 192 

gagctcggat ccaataatct ttgtctgagg gcagcacaca tatncagtgc catggnaact 60 

ggtctacccc acatgggagc agcatgccgt agntatataa ggtcattccc tgagtcagac 120 

atgcytyttt gaytaccgtg tgccaagtgc tggtgattct yaacacacyt ccatcccgyt 180 

cttttgtgga aaaactggca cttktctgga actagcarga catcacttac aaattcaccc 240 

acgagacact tgaaaggtgt aacaaagcga ytcttgcatt gctttttgtc cctccggcac 300 

cagttgtcaa tactaacccg ctggtttgcc tccatcacat ttgtgatctg tagctctgga 360 

tacatctcct gacagtactg aagaacttct tcttttgttt caaaagcarc tcttggtgcc 420 

tgttggatca ggttcccatt tcccagtcyg aatgttcaca tggcatattt wacttcccac 480 

aaaacattgc gatttgaggc tcagcaacag caaatcctgt tccggcattg gctgcaagag 540 

cctcgatgta gccggccagc gccaaggcag gcgccgtgag ccccaccagc agcagaagca 600 

g 601 



<210> 193 
<211> 608 
<212> DNA 
<213> Homo sapien 

<220> 

<221> mis c_f eature 
<222> (1) . . . (608) 
<223> n = A,T,C or G 



<400> 193 

dtacagccca natcccacca cgaagatgcg cttgttgact gagaacctga tgcggtcact 60 

ggtcccgctg tagccccagc gactctccac ctgctggaag cggttgatgc tgcactcytt 120 

cccaacgcag gcagmagcgg gsccggtcaa tgaactccay tcgtggcttg gggtkgacgg 180 

tkaagtgcag gaagaggctg accacctcgc ggtccaccag gatgcccgac tgtgcgggac 240 

ctgcagcgaa actcctcgat ggtcatgagc gggaagcgaa tgaggcccag ggccttgccc 300 

agaaccttcc gcctgttctc tggcgtcacc tgcagctgct gccgctgaca ctcggcctcg 360 

gaccagcgga caaacggcrt tgaacagccg cacctcacgg atgcccagtg tgtcgcgctc 420 

caqqainingsc accagcgtgt ccaggtcaat gtcggtgaag ccctccgcgg gtratggcgt 480 

ct.gcagtgtt tttgtcgatg ttctccaggc acaggctggc cagctgcggt tcatcgaaga 540 

gtcgcgcctg cgtgagcagc atgaaggcgt tgtcggctcg cagttcttct tcaggaactc 600 

cacgcaat 608 



<210> 194 
<211> 392 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (392) 
<223> n - A, T, C or G 

<400> 194 

gaacggctgg accttgcctc gcattgtgct tgctggcagg gaataccttg gcaagcagyt 60 
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ccagtccgag cagccccaga ccgctgccgc ccgaagctaa gcctgcctct ggccttcccc 120 

tccgcctcaa tgcagaacca gtagtgggag cactgtgttt agagttaaga gtgaacactg 180 

tttgatttta cttgggaatt tcctctgtta tatagctttt cccaatgcta atttccaaac 240 

aacaacaaca aaataacatg tttgcctgtt aagttgtata aaagtaggtg attctgtatt 300 

taaagaaaat attactgtta catatactgc ttgcaatttc tgtatttatt gktnctstgg 360 

aaataaatat agttattaaa ggttgtcant cc 392 



<210> 195 
<211> 502 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (502) 
<223> n = A,T,C or G 



<400> 195 

ccsttkgagg ggtkaggkyc cagttyccga gtggaagaaa caggccagga gaagtgcgtg 60 

ccgagctgag gcagatgttc ccacagtgac ccccagagcc stgggstata gtytctgacc 120 

cctcncaagg aaagaccacs ttctggggac atgggctgga gggcaggacc tagaggcacc 180 

aagggaaggc cccattccgg ggstgttccc cgaggaggaa gggaaggggc tctgtgtgcc 240 

ccccasgagg aagaggccct gagtcctggg atcagacacc ccttcacgtg tatccccaca 300 

caaatgcaag ctcaccaagg tcccctctca gtccccttcc stacaccctg amcggccact 360 

gscscacacc cacccagagc acgccacccg ccatggggar tgtgctcaag gartcgcngg 420 

gcarcgtgga catctngtcc cagaaggggg cagaatctcc aatagangga ctgarcmstt 480 

gctnanaaaa aaaaanaaaa aa 502 



<210> 196 
<211> 665 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc__feature 
<222> (1) . . . (665) 
<223> n = A,T,C or G 



<400> 196 

ggttacttgg tttcattgcc accacttagt ggatgtcatt tagaaccatt ttgtctgctc 60 

cctctggaag ccttgcgcag agcggacttt gtaattgttg gagaataact gctgaatttt 120 

wagctgtttk gagttgatts gcaccactgc acccacaact tcaatatgaa aacyawttga 180 

actwatttat tatcttgtga aaagtataac aatgaaaatt ttgttcatac tgtattkatc 240 

aagtatgatg aaaagcaawa gatatatatt cttttattat gttaaattat gattgccatt 300 

attaatcggc aaaatgtgga gtgtatgttc ttttcacagt aatatatgcc ttttgtaact 360 

tcacttggtt attttattgt aaatgartta caaaattctt aatttaagar aatggtatgt 420 

watatttatt tcattaattt ctttcctkgt ttacgtwaat tttgaaaaga wtgcatgatt 480 

tcttgacaga aatcgatctt gatgctgtgg aagtagtttg acccacatcc ctatgagttt 540 

ttcttagaat gtataaaggt tgtagcccat cnaacttcaa agaaaaaaat gaccacatac 600 

tttgcaatca ggctgaaatg tggcatgctn ttctaattcc aactttataa actagcaaan 660 

aagtg 665 



<210> 197 
<211> 492 
<212> DNA 
<213> Homo sapien 

<220> 
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<221> misc_feature 
<222> (1) ... (-192) 
<223> n = A, T, C or G 



<400> 197 

trttriLLttt ttttttttgc aggaaggatt ccatttattg rggatgcatt ttcacaatat 60 

atgtttattg gagcqatcca ttatcagtga aaagtatcaa gtgtttataa natttttaqg 120 

aaggcagatt cacagaacat gctngtcngc ttgcagtttt acctcgtana gatnacagag 180 

aattatagtc naaccagtaa acnaggaatt tacttttcaa aagattaaat ccaaactgaa 240 

caaaattcta ccctgaaact tactccatcc aaatattgga ataanagtca gcagtgatac 300 

3t:t.ctcttct gaactttaga ttttctagaa aaatatgtaa tagtgatcag gaagagctct 360 

Lgttcaaaag tacaacnaag caatgttccc ttaccatagg ccttaattca aactttgatc 420 

catttcactc ccatcacggg agtcaatgct acctgggaca cttgtatttt gttcatnctg 480 

ancntggctt aa 492 



<210> 198 
<211> 478 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (478) 
<223> n - A, T, C or G 



<400> 198 

tttnttttgn atttcantct gtannaanta ttttcattat gtttattana aaaatatnaa 60 

tgtntccacn acaaatcatn ttacntnagt aagaggccan ctacattgta caacatacac 120 

tgagtatatt ttgaaaagga caagtttaaa gtanacncat attgccganc atancacatt 180 

tatacatggc ttgattgata tttagcacag canaaactga gtgagttacc agaaanaaat 240 

natatatgtc aatcngattt aagatacaaa acagatccta tggtacatan catcntgtag 300 

gagttgtggc tttatgttta ctgaaagtca atgcagttcc tgtacaaaga gatggccgta 360 

agcattctag tacctctact ccatggttaa gaatcgtaca cttatgttta catatgtnca 420 

ggqtaagaat tgtgttaagt naanttatgg agaggtccan gagaaaaatt tgatncaa 478 



<210> 199 
<211> 482 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> ( 1) . . , (482) 
<223> n = A,T,C or G 



<400> 199 

agtgacttgt cctccaacaa aaccccttga tcaagtttgt ggcactgaca atcagaccta 60 

tgctagttcc tgtcatctat tcgctactaa atgcagactg gaggggacca aaaaggggca 120 

tcaactccag ctggattatt ttggagcctg caaatctatt cctacttgta cggactttga 180 

agtgattcag tttcctctac ggatgagaga ctggctcaag aatatcctca tgcagcttta 240 

tgaagccnac tctgaacacg ctggttatct nagatgagaa ncagagaaat aaagtcnaga 300 

aaatttacct ggangaaaag aggctttngg ctggggacca tcccattgaa ccttctctta 360 

anggacttta agaanaaact accacatgtn tgtngtatcc tggtgccngg ccgtttantg 420 

aacntngacn ncacccttnt ggaatanant cttgacngcn tcctgaactt gctcctctgc 480 

qa 482 



<210> 200 
<211> 270 
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<212> DNA 

<213> Homo sapien 

<220> 

<221> iaisc_f eature 
<222> (1) . . . (270) 
<223> n = A,T,C or G 



<400> 200 

cggccgcaag tgcaactcca gctggggccg tgcggacgaa gattctgcca gcagttggtc 60 

cgactgcgac gacggcggcg gcgacagtcg caggtgcagc gcgggcgcct ggggtcttgc 120 

aaggctgagc tgacgccgca gaggtcgtgt cacgtcccac gaccttgacg ccgtcgggga 180 

cagccggaac agagcccggt gaangcggga ggcctcgggg agcccctcgg gaagggcggc 24 0 

ccgagagata cgcaggtgca ggtggccgcc 270 



<210> 201 
<211> 419 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (419) 
<223> n = A, T,C or G 



<400> 201 

tttttttttt ttttggaatc tactgcgagc acagcaggtc agcaacaagt ttattttgca 60 

gctagcaagg taacagggta gggcatggtt acatgttcag gtcaacttcc tttgtcgtgg 120 

ttgattggtt tgtctttatg ggggcggggt ggggtagggg aaancgaagc anaantaaca 180 

tggagtgggt gcaccctccc tgtagaacct ggttacnaaa gcttggggca gttcacctgg 24 0 

tctgtgaccg tcattttctt gacatcaatg ttattagaag tcaggatatc ttttagagag 300 

tccactgtnt ctggagggag attagggttt cttgccaana tccaancaaa atccacntga 360 

aaaagt"-t'. gga "tga'tncangt acngaatacc ganggcatan ttctcatant cgg'^ggcca 419 



<210> 202 
<211> 509 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (509) 
<223> n = A, T,C or G 



<400> 202 

tttntttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 60 

tggcacttaa tccattttta tttcaaaatg tctacaaant ttnaatncnc cattatacng 120 

gtnattttnc aaaatctaaa nnttattcaa atntnagcca aantccttac ncaaatnnaa 180 

tacncncaaa aatcaaaaat atacntntct ttcagcaaac ttngttacat aaattaaaaa 240 

aatatatacg gctggtgttt tcaaagtaca attatcttaa cactgcaaac atntttnnaa 300 

ggaactaaaa taaaaaaaaa cactnccgca aaggttaaag ggaacaacaa attcntttta 360 

caacancnnc nattataaaa atcatatctc aaatcttagg ggaatatata cttcacacng 420 

ggatcttaac ttttactnca ctttgtttat ttttttanaa ccattgtntt gggcccaaca 4 80 

caatggnaat nccnccncnc tggactagt 509 



<210> 203 
<211> 583 
<212> DNA 
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<213> Homo sapien 
<220> 

<221> misc^f eature 
<222> (1) . . . (583) 
<223> n - A, T, C or G 



<400> 203 

ttrttttttt ttttttttga cccccctctt ataaaaaaca agttaccatt ttattttact 60 

tacacatatt tattttataa ttggtattag atattcaaaa ggcagctttt aaaatcaaac 120 

r.aaatggaaa ctgccttaga tacataattc ttaggaatta gcttaaaatc tgcctaaagt 180 

gaaaatcttc tctagctctt ttgactgtaa atttttgact cttgtaaaac atccaaattc 240 

atttttcttg tctttaaaat tatctaatct ttccattttt tccctattcc aagtcaattt 300 

gcttctctag cctcatttcc tagctcttat ctactattag taagtggctt ttttcctaaa 360 

agggaaaaca ggaagagana atggcacaca aaacaaacat tttatattca tatttctacc 420 

tacgttaata aaatagcatt ttgtgaagcc agctcaaaag aaggcttaga tccttttatg 480 

tccattttag tcactaaacg atatcnaaag tgccagaatg caaaaggttt gtgaacattt 540 

attcaaaagc taatataaga tatttcacat actcatcttt ctg 583 



<210> 204 
<211> 589 
<212> DMA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) ... (589) 
<223> n = A,T,C or G 



<400> 204 

ttttttttnt tttttttttt ttttttnctc ttcttttttt ttganaatga ggatcgagtt 60 

tltcactctc tagatagggc atgaagaaaa ctcatctttc cagctttaaa ataacaatca 120 

aatctcttat gctatatcat attttaagtt aaactaatga gtcactggct tatcttctcc 180 

tyaaggaaat ctgttcattc ttctcattca tatagttata tcaagtacta ccttgcatat 240 

tgagaggttt ttcttctcta tttacacata tatttccatg tgaatttgta tcaaaccttt 300 

attttcatgc aaactagaaa ataatgtntt cttttgcata agagaagaga acaatatnag 360 

cattacaaaa ctgctcaaat tgtttgttaa gnttatccat tataattagt tnggcaggag 420 

ctaatacaaa tcacatttac ngacnagcaa taataaaact gaagtaccag ttaaatatcc 480 

aaaataatta aaggaacatt tttagcctgg gtataattag ctaattcact ttacaagcat 540 

ttattnagaa tgaattcaca tgttattatt ccntagccca acacaatgg 589 



<210> 205 

<211> 545 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (545) 
<223> n = A, T, C or G 

<400> 205 

Lttttntttt ttttttcagt aataatcaga acaatattta 

aqaaaagtgc cttacattta ataaaagttt gtttctcaaa 

tngtcttgaa caccaatatt aatttgagga aaatacacca 

rtaagatcat agagcttgta agtgaaaaga taaaatttga 

aaaaatccac tattagcaaa taaattacta tggacttctt 

atggggtgtc actggtaaac caacacattc tgaaggatac 



tttttatatt taaaattcat 60 

gtgatcagag gaattagata 120 

aaatacatta agtaaattat 180 

cctcagaaac tctgagcatt 240 

gctttaattt tgtgatgaat 300 

attacttagt gatagattct 360 
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tatgtacttt gctanatnac gtggatatga gttgacaagt ttctctttct tcaatctttt 420 

aaggggcnga ngaaatgagg aagaaaagaa aaggattacg catactgttc tttctatngg 480 

aaggattaga tatgtttcct ttgccaatat taaaaaaata ataatgttta ctactagtga 540 

aaccc 545 



<210> 206 
<211> 487 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222>. (1) . . . (487) 
<223> n = A, T,C or G 



<400> 206 

tttttttttt ttttttagtc aagtttctna tttttattat aattaaagtc ttggtcattt 60 

catttattag ctctgcaact tacatattta aattaaagaa acgttnttag acaactgtna 120 

caatttataa atgtaaggtg ccattattga gtanatatat tcctccaaga gtggatgtgt 180 

cccttctccc accaactaat gaancagcaa cattagttta attttattag tagatnatac 240 

actgctgcaa acgctaattc tcttctccat ccccatgtng atattgtgta tatgtgtgag 300 

ttggtnagaa tgcatcanca atctnacaat caacagcaag atgaagctag gcntgggctt 360 

tcggtgaaaa tagactgtgt ctgtctgaat caaatgatct gacctatcct cggtggcaag 420 

aactcttcga accgcttcct caaaggcngc tgccacattt gtggcntctn ttgcacttgt 480 

ttcaaaa 487 



<210> 207 
<211> 332 
<212> DNA 
<213> Homo sapien 

<220> 

<^221> misc feature 
<222> (1) .7. (332) 
<223> n A,T,C or G 



<400> 207 

tgaattggct aaaagactgc atttttanaa ctagcaactc ttatttcttt cctttaaaaa 60 

tacatagcat taaatcccaa atcctattta aagacctgac agcttgagaa ggtcactact 120 

gcatttatag gaccttctgg tggttctgct gttacntttg aantctgaca atccttgana 180 

atctttgcat gcagaggagg taaaaggtat tggattttca cagaggaana acacagcgca 24 0 

gaaatgaagg ggccaggctt actgagcttg tccactggag ggctcatggg tgggacatgg 300 

aaaagaaggc agcctaggcc ctggggagcc ca 332 



<210> 208 
<211> 524 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (524) 
<223> n - A, T,C or G 



<400> 208 

agggcgtggt gcggagggcg ttactgtttt gtctcagtaa caataaatac aaaaagactg 60 

gttgtgttcc ggccccatcc aaccacgaag ttgatttctc ttgtgtgcag agtgactgat 120 

tttaaaggac atggagcttg tcacaatgtc acaatgtcac agtgtgaagg gcacactcac 180 
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'.^ccc^cqzga ttcacartta gcaaccaaca aragctcatg agtccatact tgtaaatact 240 

rrtggcaqaa tactrntrga aacttgcaga tgataactaa gatccaagat atttcccaaa 300 

::;'\:iatagaa gtgggtcata a;;attaatta cctgntcaca tcagcttcca tttacaagtc 360 

^.^o cccag acactgacat caaactaagc ccacttagac tcctcaccac cagtctgtcc 420 

tdtcatcaga caggaggctg tcaccttgac caaattctca ccagtcaatc atctatccaa 480 

~ rj -.:ccatt ac ctgatccact tccggraatg caccaccttg gtga 524 



<210> 209 

<211> 159 

<212> DNA 

<213> Homo sapien 



<hOO> 209 

:K^atgaqTc^a atccagagtt gccatggaga aaattccagt gtcagcattc ttgctccttg 60 

^.ggccctcLc ctacactctg gccagagata ccacagtcaa acctggagcc aaaaaggaca 120 

caaaggacic tcgacccaaa ctgccccaga ccctctcca 159 



< ?.10 ~ ' 210 
<2:i> 256 
<212^ PNA 
<213^ Homo s dpi en 

<220> 

<221> misc_feature 
<222^ ( i ) . . . (256) 
<223> n » A,T,C or G 



<400> 210 

actccctggc aqacaaaggc agaggagaga gctctgttag ttctgtgttg ttgaactgcc 60 

actgaatttc tttccacttg gactattaca tgccanttga gggactaatg gaaaaacgta 120 

tggggagatt ttanccaatt tangtntgta aatggggaga ctggggcagg cgggagagat 180 

irtgcagggtg naaatgggan ggctggtttg ttanatgaac agggacatag gaggtaggca 240 

ccaggatgct aaatca 256 



<210> 211 
<211> 264 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (264) 
<223> n = A,T,C or G 



<.400> 211 

acattgtttt tttgagataa agcattgaga gagctctcct taacgtgaca caatggaagg 60 

actggaacac atacccacat crttgttctg agggataatt ttctgataaa gtcttgctgt 120 

atattcaagc acatatgtta tatattattc agttccatgt ttatagccta gttaaggaga 180 

ggggagatac attcngaaag aggactgaaa gaaatactca agtnggaaaa cagaaaaaga 240 

aaaaaaggag caaatgagaa gcct 264 



<210> 212 
<211> 328 
<212> DNA 
<213> Homo sapien 

^:;220> 

<221> misc feature 
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<222> (1) . . . (328) 
<223> n = A, T,C or G 

<400> 212 

acccaaaaat ccaatgctga atatttggct tcattattcc canattcttt gattgtcaaa 60 

ggatttaatg ttgtctcagc ttgggcactt cagttaggac ctaaggatgc cagccggcag 120 

gtttatatat gcagcaacaa tattcaagcg cgacaacagg ttattgaact tgcccgccag ISO 

ttnaatttca ttcccattga cttgggatcc ttatcatcag ccagagagat tgaaaattta 240 

cccctacnac tctttactct ctgganaggg ccagtggtgg tagctataag cttggccaca 300 

tttttttttc ctttattcct ttgtcaga 328 

<210> 213 

<211> 250 

<212> DNA 

<213> Homo sapien 

<220> 

<221> niisc_f eature 
<222> (1) . . . (250) 
<223> n = A,T,C or G 

<400> 213 

acttatgagc agagcgacat atccnagtgt agactgaata aaactgaatt ctctccagtt 60 

taaagcattg ctcactgaag ggatagaagt gactgccagg agggaaagta agccaaggct 120 

cattatgcca aagganatat acatttcaat tctccaaact tcttcctcat tccaagagtt 180 

ttcaatattt gcatgaacct gctgataanc catgttaana aacaaatatc tctctnacct 240 

tctcatcggt 250 

<210> 214 
<211> 444 
<212> DNA 
<213> Homo sapxGn 

<220> 

<221> inisc_f eature 
<222> (1) . . , (444) 
<223> n = A,T,C or G 

<400> 214 

acccagaatc caatgctgaa tatttggctt cattattccc agattctttg attgtcaaag 60 

gatttaatgt tgtctcagct tgggcacttc agttaggacc taaggatgcc agccggcagg 120 

tttatatatg cagcaacaat attcaagcgc gacaacaggt tattgaactt gcccgccagt 180 

tgaatttcat tcccattgac ttgggatcct tatcatcagc canagagatt gaaaatttac 240 

ccctacgact ctttactctc tggagagggc cagtggtggt agctataagc ttggccacat 300 

ttttttttcc tttattcctt tgtcagagat gcgattcatc catatgctan aaaccaacag 360 

agtgactttt acaaaattcc tataganatt gtgaataaaa ccttacctat agttgccatt 420 

actttgctct ccctaatata cctc 444 

<210> 215 
<211> 366 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (366) 
<223> n = A, T,C or G 
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<400> 215 



acttacgagc agaqcgacat atccaaqtgt anactgaata aaactgaatt ctctccagtt 

^aaagcattg ctcactgaag ggatagaagr gactgccagg agggaaagta agccaaggct 

cattatgcca aagganatat acatttcaat tctccaaact tcttcctcat tccaagagtt 

ttcaatattr gcatgaacct gctgataagc catgttgaga aacaaatatc tctctgacct 

t.crcatcggt aagcagaggc tgtaggcaac atggaccata gcgaanaaaa aactragtaa 

tccaagctgt tttctacact gtaaccaggt ttccaaccaa ggtggaaatc tcctatactt 
qa' qc c 



60 
120 
180 
240 
300 
360 
366 



<210> 216 

<211> 260 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 

<222> (1) . . . (260) 

<223> n = A, T, C or G 

<400> 216 

ctgtataaac agaactccac tgcangaggg agggccgggc caggagaatc tccgcttgtc 60 

caagacaggg gcctaaggag ggtctccaca ctgctnntaa gggctnttnc atttttttat 120 

taataaaaag tnnaaaaggc ctcttctcaa cttttttccc ttnggctgga aaatttaaaa 180 

atcaaaaatt tcctnaagtt ntcaagctat catatatact ntatcctgaa aaagcaacat 240 

aattcttcct tccctccttt 260 

<210> 217 

<211> 262 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . , (262) 
<223> n - A, T, C or G 

<400> 217 

acciacgtgg gtaagtttan aaatgttata atttcaggaa naggaacgca tataattgta 60 

tcttgcctat aattttctat tttaataagg aaatagcaaa ttggggtggg gggaatgtag 120 

ggcattctac agtttgagca aaatgcaatt aaatgtggaa ggacagcact gaaaaatttt 180 

atgaataatc tgtatgatta tatgtctcta gagtagattt ataattagcc acttacccta 240 

atatccttca tgcttgtaaa gt 262 

<210> 218 
<211> 205 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) ... (205) 
<223> n = A, C or G 

<400> 218 

accaaggtgg tgcattaccg gaantggatc aangacacca tcgtggccaa cccctgagca 60 

ct;cc-T ar.caa ctcccttttg tagtaaactt ggaaccttgg aaatgaccag gccaagactc 120 

-jg^iccrcccc agttctactg acctttgtcc ttangtntna ngtccaqggt tgctaggaaa 180 
ariaaatcagc agacacaggt gtaaa 205 
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<210> 219 

<211> 114 

<212> DNA 

<213> Homo sapien 

<40C> 219 

tactgttttg tctcagtaac aataaataca aaaagactgg ttgtgttccg gccccatcca 60 
accacgaagt tgatttctct tgtgtgcaga gtgactgatt ttaaaggaca tgga 114 

<210> 220 

<211> 93 

<212> DNA 

<213> Hono sapien 

<400> 220 

actagccagc acaaaaqgca gggtagcctg aattgctttc tgctctttac atttctttta 60 
aaataagcat ttagtgctca gtccctactg agt 93 

<210> 221 
<211> 167 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) ... (167) 
<223> n = A,T,C or G 



<400> 221 

actangtgca ggtgcgcaca aatatttgtc gatattccct tcatcttgga ttccatgagg 60 

tcttttgccc agcctgtggc tctactgtag taagtttctg ctgatgagga gccagnatgc 120 

cccccactac cttccctgac gctccccana aatcacccaa ccucfcgL 167 



<210> 222 
<211> 351 
<212> DNA 
<213> Homo sapien 



<400> 222 

^gggcgtggt gcggagggcg gtactgacct cattagtagg aggatgcatt ctggcacccc 60 

gttcttcacc tgtcccccaa tccttaaaag gccatactgc ataaagtcaa caacagataa 120 

atgtttgctg aattaaagga tggatgaaaa aaattaataa tgaatttttg cataatccaa 180 

ttttctcttt tatatttcta gaagaagttt ctttgagcct attagatccc gggaatcttt 240 

taggtgagca tgattagaga gcttgtaggt tgcttttaca tatatctggc atatttgagt 300 

ctcgtatcaa aacaatagat tggtaaaggt ggtattattg tattgataag t 351 



<210> 223 
<211> 383 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (383) 
<223> n = A, T,C or G 

<400> 223 
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M^jiaacaaaca aacaaaaaaa acaattcttc attcagaaaa attatcttag ggactgatat 60 

tggt:aat_taT: ggtcaatcta atwrtrttkt ggggcatttc cttacattgt cttgacaaga 120 

\ z i.r.ciazgtc tgtgccaaaa "iitt rgtatrt tatttgoaga cttcttatca aaagtaatgc 180 

^ gcccjaagga agtctaagga attactagtg ttcccmtcac ttgtttggag tgtgctattc 240 

taaaagattt tgatttcctg gaatgacaat tatattttaa ctttggtggg ggaaanagtt 300 

^taggaccac agtcttcact tctgatactt gtaaattaat cttttattgc acttgttttg 360 

.i.^^Httaagc tatatgtrta aaa 333 



<210> 224 
<211> 320 
<212> DNA 
<213> Homo sapien 



<4G0> 224 

cccctgaagg cttcttgtta gaaaatagta cagttacaac caataggaac aacaaaaaga 60 

aaaagrttgt gacattgtag tagggagtgt gtacccctta ctccccatca aaaaaaaaat 120 

ggatacatgg ttaaaggata raagggcaat attttatcat atgttctaaa agagaaggaa 180 

gagaaaatac tactttctcr aaatggaagc ccttaaaggt gctttgatac tgaaggacac 240 

a.::iatgtggcc gtccatcctc ctttaragtt gcatgacttg gacacggtaa ctgttgcagt 300 

tttaractcm gcattgtgac 309 



<210> 225 

<211> 1214 

<212> DNA 

<213> Homo sapien 



<400> 225 

gaggactgca gcccgcactc gcagccctgg caggcggcac tggtcatgga aaacgaattg 60 

ttctgctcgg gcgtcctggt gcatccgcag tgggtgctgt cagccgcaca ctgtttccag 120 

aactcctaca ccatcgggct gggcctgcac agtcttgagg ccgaccaaga gccagggagc 180 

cagatggtgg aggccagcct ctccgtacgg cacccagagt acaacagacc cttgctcgct 240 

aacgacctca tgctcatcaa gttggacgaa tccgtgtccg agtctgacac catccggagc 300 

atcagcattg cttcgcagtg ccctaccgcg gggaactctt gcctcgtttc tggctggggt 360 

:-tgctggcga acggcagaat gcctaccgtg ctgcagtgcg tgaacgtgtc ggtggtgtct 420 

gaggaggtct gcagtaagct ctatgacccg ctgtaccacc ccagcatgtt ctgcgccggc 480 

ggagggcaag accagaagga ctcctgcaac ggtgactctg gggggcccct gatctgcaac 540 

qggtacttgc agggccttgt gtctttcgga aaagccccgt gtggccaagt tggcgtgcca 600 

Qgtgtctaca ccaacctctg caaattcact gagtggatag agaaaaccgt ccaggccagt 660 

taactctggg gactgggaac ccatgaaatt gacccccaaa bacatcctgc ggaaggaatt 720 

caggaatatc tgttcccagc ccctcctccc tcaggcccag gagtccaggc ccccagcccc 780 

tcctccctca aaccaagggt acagatcccc agcccctcct ccctcagacc caggagtcca 840 

gaccccccag cccctcctcc ctcagaccca ggagtccagc ccctcctccc tcagacccag 900 

gagtccagac cccccagccc ctcctccctc agacccaggg gtccaggccc ccaacccctc 960 

ctccctcaga ctcagaggtc caagccccca acccctcctt ccccagaccc agaggtccag 1020 

■:;tGCcagccc ctcctccctc agacccagcg gtccaatgcc acctagactc tccctgtaca 1080 

cagtgccccc ttgtggcacg ttgacccaac cttaccagtt ggtttttcat tttttgtccc 1140 

tttcccctag atccagaaat aaagtctaag agaagcgcaa aaaaaaaaaa aaaaaaaaaa 1200 

aaaaaaaaaa aaaa l'^14 



<210> 226 
<211> 119 
<212> DNA 
<213> Homo sapien 

<400> 226 

:i-uM:agtatg tgcagggaga cggaacccca tgtgacagcc cactccacca gggttcccaa 60 
aociacclggc ccagtcataa tcattcatcc tgacagtggc aataatcacg ataaccagt 119 
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<210> 227 
<211> 818 
<212> DNA 
<213> Homo sapien 



<400> 227 
acaattcata gggacgacca 
tttttgctac atatggggtc 
acggacggtt cttagcacaa 
aattttcctc ctctggagga 
gagaaagcca cgctcggcct 
gcttgtcccc ttccaatcag 
agggcctcct caggagcagt 
ggaaagggtg caccctcagc 
acctgctggc tgtcttggga 
gccatccact ggacatgaag 
gacaggctct gccctcaagc 
aaagccattc ccacaaatcc 
caagaggata tgaggactgt 
gtccacttct aggttttcag 



atgaggacag 
ccttttcatt 
tttgtgaaat 
aaggtggtga 
tctctgaacc 
ccacttctga 
ccaagagttt 
agagaagccg 
tgcgcccagc 
ctgaggacac 
cggctgaggg 
agaccatacc 
ctcagcctgg 
cctagatggg 



ggaatgaacc 
ctttgcaaaa 
ctgtgtaraa 
ttgacaggca 
aggatggaac 
gaacccccat 
tcaaagataa 
agagcttaac 
ctttgagagg 
tgggcttcaa 
cagcaaccac 
atgaagcaac 
ctttgggctg 
agtcgtgt 



cggctctccc 
acactgggtt 
ccgggctttg 
gggagacagt 
ggcagacccc 
ctaacttcct 
cgtgacaact 
tctggtcgtt 
ccactacccc 
cactgagttg 
tctcctcccc 
gagacccaaa 
acaccatgca 



ccagccctga 
ttctgagaac 
caggggagat 
gacaaggcta 
tgaaaacgaa 
actggaaaag 
accatctaga 
tccagagaca 
atgaacttct 
tcatgagagg 
tttctcacgc 
cagtttggct 
cacacacaag 



<210> 228 
<211> 744 
<212> DNA 
<213> Homo sapien 



<400 
actggagaca 
gtcatgacgt 
tcgtggccga 
taggaaaagt 
tgctcggtgc 

gctggcagct 
gagaaggcta 
ccagacggtg 
ccgtggtatg 
ttcttttcgt 
ttgggagatg 
cttcactctg 



> 228 
ctgttgaact 
ttgacatacc 
cctggcctct 
ggcttcgtaa 
acattggggt 
tt 

gaatggcttg 
ggatgcttgt 
ttggccactc 
ccttggccca 
taatgttcct 
tggaccagag 
aagtagctgg 



tgatcaagac 
tttggaacga 
cctggcctgt 
aatagaagag 
gctttgggat 
gt tccaCt,ga 
ccggtggctc 
ctagtgttct 
ccttctaaaa 
ttccagcagt 
ctgtgttgtc 
atccactcct 
tggt 



ccagaccacc 
gcctcctcct 
ttcttaagat 
cagtcactgt 
aaaagattta 
agcL Lut ccc 
tgtggcaaga 
tagctgtcac 
cacaggcgcc 
cccagttatg 
agctgtcttc 
taagaaccag 



ccaggtctcc 
tggaagatgg 
gcggagt cac 
ggaactacca 
tgagccaact 
acagcagt cc 
t cacactgag 
gttggctcct 
ctcctggtga 
catttcaagt 
atttcctggg 
tggcgaaaga 



ttcgtgggat 
aagaccgtgt 
atttcaatgg 
aatggcgaga 
attctctggc 
acctctgcag 
atcgatgggt 
tccaggttgg 
cagtgacccg 
ttggggtttg 
ctaagcagca 
cactttcttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
818 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
744 



<210> 229 
<211> 300 
<212> DNA 
<213> Homo sapien 

<400> 229 

cgagtctggg ttttgtctat aaagtttgat ccctcctttt ctcatccaaa tcatgtgaac 60 

cattacacat cgaaataaaa gaaaggtggc agacttgccc aacgccaggc tgacatgtgc 120 

tgcagggttg ttgtttttta attattattg ttagaaacgt cacccacagt ccctgttaat 180 

ttgtatgtga cagccaactc tgagaaggtc ctatttttcc acctgcagag gatccagtct 240 

cactaggctc ctccttgccc tcacactgga gtctccgcca gtgtgggtgc ccactgacat 300 

<210> 230 

<211> 301 

<212> DNA 

<213> Homo sapien 
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<400> 230 



cagcagaaca aatacaaata tgaagagtcc aaagatctca taaaatctat gctgaggaat 

aagcgacagt tcaaggagga gaagcttgca gagcagctca agcaagctga ggagctcagg 

caa-Lataaaq ucctggttca cactcaggaa cgagagctga cccagttaag ggagaagttg 

cgqgaaqgqa gagatgcctc cctctcatrg aatgagcatc tccaggccct cctcactccg 

■jatqaaccgg acaagtccca ggggcaggac ctccaagaaa cagacctcgg ccgcgaccac 
Q 



60 

180 
240 
300 
301 



<210> 231 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 231 

qcaagcacgc tggcaaatct ctgtcaggtc agctccagag aagccattag tcattttagc 60 

caggaactcc aagtccacat ccttggcaac tggggacttg cgcaggttag ccttgaggat 120 

ggcaacacgg gacttctcat caggaagtgg gatgtagatg agctgatcaa gacggccagg 180 

tctgaggatg gcaggatcaa tgatgtcagg ccggttggta ccgccaatga tgaacacatt 240 

tttttttgtg gacatgccat ccatttctgt caggatctgg ttgatgactc ggtcagcagc 300 

c 301 

<210> 232 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 232 

agtaggtatt tcgtgagaag ttcaacacca aaactggaac atagttctcc ttcaagtgtt 60 

ggcgacagcg gggcttcctg attctggaat ataactttgt gtaaattaac agccacctat 120 

agaagagtcc atctgctgtg aaggagagac agagaactct gggttccgtc gtcctgtcca 180 

cgtgctgtac caagtgctgg tgccagcctg ttacctgttc tcactgaaaa tctggctaat 240 

gctcttgtgt atcacttctg attctgacaa tcaatcaatc aatggcctag agcactgact 300 

9 301 

<210> 233 
<211> 301 
<212> DNA 
<213> Homo sapxen 

<400> 233 

atgactgact tcccagtaag gctctctaag gggtaagtag gaggatccac aggatttgag 60 

atgctaaggc cccagagatc gtttgatcca accctcttat tttcagaggg gaaaatgggg 120 

cctagaagtt acagagcatc tagctggtgc gctggcaccc ctggcctcac acagactccc 180 

gagtagctgg gactacaggc acacagtcac tgaagcaggc cctgttagca attctatgcg 240 

"cacaaattaa catgagatga gtagagactt tattgagaaa gcaagagaaa atcctatcaa 300 

c 301 

<210> 234 

<211> 301 

<212> DNA 

<213> Homo sapien 

<400> 234 

ciggtcctaca catcgagact catccatgat tgatatgaat ttaaaaatta caagcaaaga 60 

cattttattc atcatgatgc tttcttttgt ttcttctttt cgttttcttc tttttctttt 120 

tcaatttcag caacatactt ctcaatttct tcaggattta aaatcttgag ggattgatct 180 

cgrctcatga cagcaagttc aatgtttttg ccacctgact qaaccacttc caggagtgcc 240 

rtgatcacca gcttaatggt cagatcatct gcttcaatgg cttcgtcagt atagttcttc 300 



wo 01/51633 



PCT/llSOl/01574 



81 



t 301 

<210> 235 
<211> 283 
<212> DNA 
<213> Homo sapien 

<400> 235 

tggggctgtg catcaggcgg gtttgagaaa tattcaattc tcagcagaag ccagaatttg 60 

aattccctca tcttttaggg aatcatttac caggtttgga gaggattcag acagctcagg 120 

tgctttcact aatgtctctg aacttctgtc cctctttgtt catggatagt ccaataaata 180 

atgttatctt tgaactgatg ctcataggag agaatataag aactctgagt gatatcaaca 240 

ttagggattc aaagaaatat tagatttaag ctcacactgg tea 283 

<210> 236 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 236 

aggtcctcca ccaactgcct gaagcacggt taaaattggg aagaagtata gtgcagcata 60 

aatactttta aatcgatcag atttccctaa cccacatgca atcttcttca ccagaagagg 120 

tcggagcagc atcattaata ccaagcagaa tgcgtaatag ataaatacaa tggtatatag 180 

tgggtagacg gcttcatgag tacagtgtac tgtggtatcg taatctggac ttgggttgta 240 

aagcatcgtg taccagtcag aaagcatcaa tactcgacat gaacgaatat aaagaacacc 300 

3 301 

<210> 237 

<211> 301 

<212> DNA 

<213> Homo sapien 

< 4 0 C > 257 

cagtggtagt ggtggtggac gtggcgttgg tcgtggtgcc ttttttggtg cccgtcacaa 60 

actcaatttt tgttcgctcc tttttggcct tttccaattt gtccatctca attttctggg 120 

ccttggctaa tgcctcatag taggagtcct cagaccagcc atggggatca aacatatcct 180 

ttgggtagtt ggtgccaagc tcgtcaatgg cacagaatgg atcagcttct cgtaaatcta 240 

gggttccgaa attctttctt cctttggata atgtagttca tatccattcc ctcctttatc 300 

t 301 

<210> 238 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 238 

gggcaggttt tttttttttt ttttttgatg gtgcagaccc ttgctttatt tgtctgactt 60 

gttcacagtt cagccccctg ctcagaaaac caacgggcca gctaaggaga ggaggaggca 120 

Gcttgagact tccggagtcg aggctctcca gggttcccca gcccatcaat cattttctgc 180 

accccctgcc tgggaagcag ctccctgggg ggtgggaatg ggtgactaga agggatttca 240 

gtgtgggacc cagggtctgt tcttcacagt aggaggtgga agggatgact aatttcttta 300 

t 301 

<210> 239 
<211> 239 
<212> DNA 
<213> Homo sapien 
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<400> 239 

ataagcagct agggaattct ttatttagta atgiicctaac ataaaagttc acataactgc 60 

^tcrgtcaaa ccatgatact gagctttgrg acaacccaga aataactaag agaaggcaaa 120 

-rataatacct tagagatcaa gaaacattta cacagttcaa ctgtttaaaa atagctcaac 180 

attcagccag tgagtagagt gtgaatgcca gcatacacag tatacaggtc cttcaggga 239 

<210> 240 
<2il> 300 
<212> DNA 
<213> Homo sapien 

<400> 240 

ggtcctaatg aagcagcagc ttccacattt taacgcaggt ttacggtgat actgtccttt 60 

gggatctgcc ctccagtgga accttttaag gaagaagtgg gcccaagcta agttccacat 120 

gctgggtgag ccagatgact tctgttccct ggtcactttc ttcaatgggg cgaatggggg 180 

ctgccaggtt tttaaaatca tgcttcatct tgaagcacac ggtcacttca ccctcctcac 240 

gctgtgggtg tactttgatg aaaataccca ctttgttggc ctttctgaag ctataatgtc 300 

<210> 241 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 241 

gaggtctggt gctgaggtct ctgggctagg aagaggagtt ctgtggagct ggaagccaga 60 

cctctttgga ggaaactcca gcagctatgt tggtgtctct gagggaatgc aacaaggctg 120 

ctcctccatg tattggaaaa ctgcaaactg gactcaactg gaaggaagtg ctgctgccag 180 

tgtgaagaac cagcctgagg tgacagaaac ggaagcaaac aggaacagcc agtcttttct 240 

tcctcctcct gtcatacggt ctctctcaag catcctttgt tgtcaggggc ctaaaaggga 300 

g 301 

<210> 242 

<211> 301 

<212> BNA 

<213> Homo sapien 

<400> 242 

ccgaggtcct gggatgcaac caatcactct gtttcacgtg acttttatca ccatacaatt 60 

tgtggcattt cctcattttc tacattgtag aatcaagagt gtaaataaat gtatatcgat 120 

gtcttcaaga atatatcatt cctttttcac tagaacccat tcaaaatata agtcaagaat 180 

cttaatatca acaaatatat caagcaaact ggaaggcaga ataactacca taatttagta 240 

taagtaccca aagttttata aatcaaaagc cctaatgata accattttta gaattcaatc 300 

a 301 

<210> 243 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 243 

aggtaagtcc cagtttgaag ctcaaaagat ctggtatgag cataggctca tcgacgacat 60 

ggtggcccaa gctatgaaat cagagggagg cttcatctgg gcctgtaaaa actatgatgg 120 

tgacgtgcag tcggactctg tggcccaagg gtatggctct ctcggcatga tgaccagcgt 180 

gctggtttgt ccagatggca agacagtaga agcagaggct gcccacggga ctgtaacccg 240 

tcactaccgc atgttccaga aaggacagga gacgtccacc aatcccattg cttccatttt 300 

^ 301 



<210> 244 
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<2ll> 300 
<212> DNA 
<213> Homo sapien 

<400> 244 

gctggtttgc aagaatgaaa tgaatgattc tacagctagg acttaacctt gaaatggaaa 60 

gtcatgcaat cccatttgca ggatctgtct gtgcacatgc ctctgtagag agcagcattc 120 

ccagggacct tggaaacagt tgacactgta aggtgcttgc tccccaagac acatcctaaa 180 

aggtgttgta atggtgaaaa cgtcttcctt ctttattgcc ccttcttatt tatgtgaaca 240 

actgtttgtc ttttgtgtat cttttttaaa ctgtaaagtt caattgtgaa aatgaatatc 300 

<210> 245 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 245 

gtctgagtat ttaaaatgtt attgaaatta tccccaacca atgttagaaa agaaagaggt 60 

tatatactta gataaaaaat gaggtgaatt actatccatt gaaatcatgc tcttagaatt 120 

aaggccagga gatattgtca ttaatgtara cttcaggaca ctagagtata gcagccctat 180 

gttttcaaag agcagagatg caattaaata ttgtttagca tcaaaaaggc cactcBatac 240 

agctaataaa atgaaagacc taatttctaa agcaattctt tataatttac aaagttttaa 300 

g 301 

<210> 246 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 246 

ggtctgtcct acaatgcctg cttcttgaaa gaagtcggca ctttctagaa tagctaaata 60 

acctgggctt attttaaaga actatttgta gctcagattg gttttcctat ggctaaaata 120 

agtgcttctt gtyaaaaLta aataaaaoag ttaattcaaa gccttgatat atgrtaccac 180 

taacaatcat actaaatata ttttgaagta caaagtttga catgctctaa agtgacaacc 240 

caaatgtgtc ttacaaaaca cgttcctaac aaggtatgct ttacactacc aatgcagaaa 300 

301 



<210> 247 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 247 

aggtcctttg gcagggctca tggatcagag ctcaaactgg agggaaaggc atttcgggta 60 

gcctaagagg gcgactggcg gcagcacaac caaggaaggc aaggttgttt cccccacgct 120 

gtgtcctgtg ttcaggtgcg acacacaatc ctcatgggaa caggatcacc catgcgctgc 180 

ccttgatgat caaggttggg gcttaagtgg attaagggag gcaagttctg ggttccttgc 240 

cttttcaaac catgaagtca ggctctgtat ccctcctttt cctaactgat attctaacta 300 

^ 301 



<210> 248 

<211> 301 

<2I2> DNA 

<213> Homo sapien 

<400> 248 

aggtccttgg agatgccatt tcagccgaag gactcttctw ttcggaagta caccctcact 60 
attaggaaga ttcttagggg taatttttct gaggaaggag aactagccaa cttaagaatt 120 
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d::iggaagaa agtggttrgq aagacagcca aagaaataaa agcagattaa attgtatcag 180 
gt.acattcca gcctgttggc aactccataa aaacatttca gattttaatc ccgaatttag 240 
cta-taagac rggartrt t g trtttratgt rgtgtgtcgc agagctaaaa actcagttcc 300 

301 



<210> 249 
<211> 301 
v212> DNA 
<213> Homo sapien 



<400> 249 

gtccagagga agcacctggt gctgaactag gcttgccctg ctgtgaactt gcacttggag 60 

ccctgacgct gctgttctcc ccgaaaaacc cgaccgacct ccgcgatctc cgtcccgccc 120 

ccaqggagac acagcagtga ctcagagctg gtcgcacact gtgcctccct cctcaccgcc 180 

cdtcgtaatg aattattttg aaaattaatt ccaecatcct ttcagattct ggatggaaag 240 

actgaatctt tgacrcagaa ttgtttgctg aaaagaatga tgtgactttc ttagtcattt 300 

3 301 



<210> 250 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 250 

qgtctgtgac aaggacttgc aggctgtggg aggcaagtga cccttaacac tacacttctc 60 

cttatcttta ttggcttgat aaacataatt atttctaaca ctagcttatt tccagttgcc 120 

cataagcaca tcagtacttt tctctggctg gaatagtaaa ctaaagtatg gtacatctac 180 

ctaaaagact actatgtgga ataatacata ctaatgaagt attacatgat ttaaagacta 240 

caataaaacc aaacatgctt ataacattaa gaaaaacaat aaagatacat gattgaaacc 300 

3 301 



<210> 251 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 251 

gccgaggtcc tacatttggc ccagtttccc cctgcatcct ctccagggcc cctgcctcat 60 

aqacaacctc atagagcata ggagaactgg ttgccctggg ggcaggggga ctgtctggat 120 

ggcaggggtc ctcaaaaatg ccactgtcac tgccaggaaa tgcttctgag cagtacacct 180 

cattgggatc aatgaaaagc ttcaagaaat cttcaggctc actctcttga aggcccggaa 240 

cctctggagg ggggcagtgg aatcccagct ccaggacgga tcctgtcgaa aagatatcct 300 



301 

<210> 252 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 252 

gcaaccaatc actctgtttc acgtgacttt tatcaccata caatttgtgg catttcctca 60 

t-tttctacat tgtagaatca agagtgtaaa taaatgtata tcgatgtctt caagaatata 120 

Lo.j- vccttt ttcactagga acccattcaa aatataagtc aagaatctta atatcaacaa 180 

atatdtcaag caaactggaa ggcagaataa ctaccataat ttagtataag tacccaaagt 240 

Ittataaatc aaaagcccta atgataacca tttttagaat tcaatcatca ctgtagaatc 300 

^ 301 



<210> 253 
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<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 253 

ttccctaaga agatgttatt ttgttgggtt 
caactaaaaa aaaaaaataa agaaaaaatg 
tggtctgatt gttttcagac cttaaaatat 
gatttttttt cttagagaac cacaaaacat 
tccatagtgc ccacagggta ttcctcacat 

g 



ttgttccccc tccatctcga ttctcgtacc 60 
tgctgcgttc tgaaaaataa ctccttagct 120 
aaacttgttt cacaagcttt aatccatgtg 180 
aaaaggagca agtcggactg aatacctgtt 240 
tttctccata ggaaaatgct ttttcccaag 300 

301 



<210> 254 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 254 
cgctgcgcct ttcccttggg ggaggggcaa 
aacttgacca attcccttga agcgggtggg 
ccaaatctct tcatcttacc ctggtggact 
gaaaaaaata aagctttgga cttttcaagg 
acttaaactg agccaggaaa agctgcagat 
t 



ggccagaggg ggtccaagtg cagcacgagg 60 
ttaaaccctg taaatgggaa caaaatcccc 120 
cctgactgta gaattttttg gttgaaacaa 180 
ttgcttaaca ggtactgaaa gactggcctc 240 
ttattaatgg gtgtgttagt gtgcagtgcc 300 

301 



<210> 255 
<211> 302 
<212> DNA 
<213> Homo sapien 



<400> 255 

agcttttttt tttttttttt tttttttttt ttcattaaaa aatagtgctc tttattataa 60 

attactgaaa tgtttcttrr ctgaatataa atataaatat gtgcaaagtt tgacttggat 120 

tgggattttg ttgagttctt caagcatctc ctaataccct caagggcctg agtagggggg 18 0 

aggaaaaagg actggaggtg gaatctttat aaaaaacaag agtgattgag gcagattgta 24 0 

aacattatta aaaaacaaga aacaaacaaa aaaatagaga aaaaaaccac cccaacacac 300 

as 302 



<210> 256 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . {301) 
<223> n = A,T,C or G 



<400> 256 

gttccagaaa acattgaagg tggcttccca aagtctaact agggataccc cctctagcct 60 

aggaccctcc tccccacacc tcaatccacc aaaccatcca taatgcaccc agataggccc 120 

acccccaaaa gcctggacac cttgagcaca cagttatgac caggacagac tcatctctat 180 

aggcaaatag ctgctggcaa actggcatta cctggtttgt ggggatgggg gggcaagtgt 24 0 

gtggcctctc ggcctggtta gcaagaacat tcagggtagg cctaagttan tcgtgttagt 300 

t 301 



<210> 257 
<211> 301 
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<212> DNA 

<213> Homo sapien 



<400> 257 

qttqtqgsgg aactctggcr tgctcattaa gtcctactga ttttcactat cccctgaatt 60 

tccccactta rttttgtctt tcacratcgc aggccttaga agaggtctac crgcctccag 120 

^ ar-ctag tccagtctac cccctggagt tagaatggcc atcctgaagt gaaaagtaat 180 

gtcacattac tcccttcagt gatttcttgt agaagtgcca atccctgaat gccaccaaga 240 

LOT Tc^atctt cacatcttta atcttatctc tttgactcct ctttacaccg gagaaggctc 300 



301 

<210> 258 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (301) 
<223> n = A, T, C or G 



<400> 258 

cagcagtagt agatgccgta tgccagcacg cccagcactc ccaggatcag caccagcacc 60 

^ggggcjccag ccaccaggcg cagaagcaag ataaacagta ggctcaagac cagagccacc 120 

cccagggcaa caagaatcca ataccaggac tgggcaaaat cttcaaagat cttaacactg 180 

atgtctcggg cattgaggct gtcaataana cgctgatccc ctgctgtatg gtggtgtcat 240 

tggtgatccc tgggagcgcc ggtggagtaa cgttggtcca tggaaagcag cgcccacaac 300 

^ 301 



<210> 259 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) , . . (301) 
<223> n = A,T,C or G 



<400> 259 

tcatatatgc aaacaaatgc agactangcc tcaggcagag actaaaggac atctcttggg 60 

gtgtcctgaa gtgatttgga cccctgaggg cagacaccta agtaggaatc ccagtgggaa 120 

gcaaagccat aaggaagccc aggattcctt gtgatcagga agtgggccag gaaggtctgt 180 

tccagctcac atctcatctg catgcagcac ggaccggatg cgcccactgg gtcttggctt 240 

ccctcccatc ttctcaagca gtgtccttgt tgagccattt gcatccttgg ctccaggtgg 300 

^ 301 



<210> 260 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 260 

ttttttttct ccctaaggaa aaagaaggaa caagtctcat aaaaccaaat aagcaatggt 60 

^'J^ggtgtctt aacttgaaaa agattaggag tcactggttt acaagttata attgaatgaa 120 

-igdaclgtaa cagccacagt tggccatttc atgccaatgg cagcaaacaa caggattaac 180 

-^ygQca.3a3 taaataagtg tgtggaagcc ctgataagtg cttaataaac agactgattc 240 

actgagacat cagtacctgc ccgggcggcc gctcgagccg aattctgcag atatccatca 300 
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301 

<210> 261 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 261 

aaatattcga gcaaatcctg taactaatgt gtctccataa aaggctttga actcagtgaa 60 

tctgcttcca tccacgattc tagcaatgac ctctcggaca tcaaagctcc tcttaaggtt 120 

agcaccaact attccataca attcatcagc aggaaataaa ggctcttcag aaggttcaat 180 

ggtgacatcc aatttcttct gataatttag attcctcaca accttcctag ttaagtgaag 240 

ggcatgatga tcatccaaag cccagtggtc acttactcca gactttctgc aatgaagatc 300 

a 301 



<210> 262 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 262 

gaggagagcc tgttacagca tttgtaagca cagaatactc caggagtatt tgtaattgtc 60 

tgtgagcttc ttgccgcaag tctctcagaa atttaaaaag atgcaaatcc ctgagtcacc 120 

cctagacttc ctaaaccaga tcctctgggg ctggaacctg gcactctgca tttgtaatga 180 

gggctttctg gtgcacacct aattttgtgc atctttgccc taaatcctgg attagtgccc 240 

catcattacc cccacattat aatgggatag attcagagca gatactctcc agcaaagaat 300 

c 301 



<210> 263 

<211> 301 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (301) 
<223> n = A, T,C or G 

<400> 263 

tttagcttgt ggtaaatgac tcacaaaact gattttaaaa tcaagttaat gtgaattttg 60 

aaaattacta cttaatccta attcacaata acaatggcat taaggtttga cttgagttgg 120 

ttcttagtat tatttatggt aaataggctc ttaccacttg caaataactg gccacatcat 180 

taatgactga cttcccagta aggctctcta aggggtaagt angaggatcc acaggatttg 24 0 

agatgctaag gccccagaga tcgtttgatc caaccctctt attttcagag gggaaaatgg 300 

g 301 

<210> 264 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 264 

aaagacgtta aaccactcta ctaccacttg tggaactctc aaagggtaaa tgacaaascc 60 

aatgaatgac tctaaaaaca atatttacat ttaatggttt gtagacaata aaaaaacaag 120 

gtggatagat ctagaattgt aacattttaa gaaaaccata scatttgaca gatgagaaag 180 

ctcaattata gatgcaaagt tataactaaa ctactatagt agtaaagaaa tacatttcac 240 

acccttcata taaattcact atcttggctt gaggcactcc ataaaatgta tcacgtgcat 300 

a 301 



BNS[X)CfD <:WO 0151633A2 I > 



w o (n/51633 



88 



P( T/l S01/tn574 



<21G> 265 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 265 

tgcccdaytt atgtgtaagt gtatccgcac ccagaggtaa aactacactg tcatctttgt 60 

cttrtrgtga cgcagtattt cttctctggg gagaagccgg gaagtcttct cctggctcta 120 

■-atartcttg gaagtctcta atcaactttt gttccatttg tttcatttct tcaggaggga 180 

i-.ttucagttt gtcaacatgt tctctaacaa cacttgccca tttctgtaaa gaatccaaag 240 

cagtccaagg ctttgacatg tcaacaacca gcataactag agtatccttc agagatacgg 300 

- 301 



<210> 266 
<211> 301 
<212> DNA 
<2I3> Homo sapien 



<400> 266 

taccgtctgc ccttcctccc atccaggcca tctgcgaatc tacatgggtc ctcctattcg 60 

acaccagatc actctttcct ctacccacag gcttgctatg agcaagagac acaacctcct 120 

ctcttctgtg ttccagcttc ttttcctgtt cttcccaccc cttaagttct attcctgggg 180 

atagagacac caatacccat aacctctctc ctaagcctcc ttataaccca gggtgcacag 240 

cacagactcc tgacaactgg taaggccaat gaactgggag ctcacagctg gctgtgcctg 300 



301 

<210> 267 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 267 

aaagagcaca ggccagctca gcctgccctg gccatctaga ctcagcctgg ctccatgggg 60 

gttctcagtg ctgagtccat ccaggaaaag ctcacctaga ccttctgagg ctgaatcttc 120 

atcctcacag gcagcttctg agagcctgat attcctagcc ttgatggtct ggagtaaagc 180 

ctcattctga ttcctctcct tcttttcttt caagttggct ttcctcacat ccctctgttc 240 

aattcgcttc agcttgtctg ctttagccct catttccaga agcttcttct ctttggcatc 300 

■- 301 



<210> 268 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 268 

aatgtctcac tcaactactt cccagcctac cgtggcctaa ttctgggagt tttcttctta 60 

gatcttggga gagctggttc ttctaaggag aaggaggaag gacagatgta actttggatc 120 

tcgaagagga agtctaatgg aagtaattag tcaacggtcc ttgtttagac tcttggaata 180 

tgctgggtgg ctcagtgagc ccttttggag aaagcaagta ttattcttaa ggagtaacca 240 

cttcccattg ttctactttc taccatcatc aattgtatat tatgtattct ttggagaact 300 



301 

<210> 269 
<211> 301 
<212> DNA 
<213> Homo sapien 
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<400> 269 

taacaatata cactagctat ctttttaact gtccatcatt agcaccaatg aagattcaat 60 

aaaattacct ttattcacac atctcaaaac aattctgcaa attcttagtg aagtttaact 120 

atagtcacag accttaaata ttcacattgt tttctatgtc tactgaaaat aagttcacta 180 

cttttctgga tattctttac aaaatcttat taaaattcct ggtattatca cccccaatta 240 

tacagtagca caaccacctt atgtagtttt tacatgatag ctctgtagaa gtttcacatc 300 

t 301 



<210> 270 

<211> 301 

<212> DNA 

<213> Homo sapien 



<^00> 27 0 

cattgaagu:; cttttgcgaa acatcagaac acaagtgctt ataaaattaa ttaagcctta 60 

cacaagaatct catattcctL ttatttctaa ggagttaaac atagatgtag ctgatgtgga 120 

gagcttgcto gtgcagtgca tattggataa cactattcat ggccgaattg atcaagtcaa 180 

ccaactccti qaactqgatc atcagaagaa gggtggtgca cgatatactg cactagataa 240 

tggaccaacc aactaaattc tctcaccagg ctgtatcagt aaactggctt aacagaaaac 300 

^ 301 



<210> 271 
<21i> 301 
<2I2> DNA 
<213> Homo sapien 

<220> 

<221> mi sc_f eature 
<222> (1) . . . (301) 
<223> n = A,T,C or G 



<400> 271 

aaaaggttct cataagatLa acaatttaaa taaatatttg atagaacatt ctttctcatt 60 

tttatagctc atctttaggg ttgatattca gttcatgctt cccttgctgt tcttgatcca 120 

gaattgcaat cacttcatca gcctgtattc gctccaattc tctataaagt gggtccaagg 180 

tgaaccacag agccacagca cacctctttc ccttggtgac tgccttcacc ccatganggt 240 

tctctcctcc agatganaac tgatcatgcg cccacatttt gggttttata gaagcagtca 300 

^ 301 



<210> 272 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 272 

taaattgcta agccacagat aacaccaatc aaatggaaca aatcactgtc ttcaaatgtc 60 

ttatcagaaa accaaatgag cctggaatct tcataatacc taaacatgcc gtatttagga 120 

tccaataatt ccctcatgat gagcaagaaa aattctttgc gcacccctcc tgcatccaca 180 

gcatcttctc caacaaatat aaccttgagt ggcttcttgt aatctatgtt ctttgttttc 240 

ctaaggactt ccattgcatc tcctacaata ttttctctac gcaccactag aattaagcag 300 

g 301 



<210> 273 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 



BNSDOCID <WO 01S1633A2 I 



w o (11/51633 



90 



PCT/l S01/(n574 



<221> misc_feature 
<222> (1) - (301; 
-:223> n = A,T,C or G 



<400> 273 

acatgtgtgt atgtgtatct ttgggaaaan aanaagacat cttgtttayt atttttttgg 60 

agagangctg ggacatggat aatcacwtaa tttgctayta tyactttaat ctgactygaa 120 

gaaccgticta aaaataaaat ttaccatgtc dtatattcct tatagtatgc ttatttcacc 180 

ttytttctgt ccagagagag tatcagtgac ananatttma gggtgaamac atgmattggt 240 

gggacutnty tttacngagm accctgcccg sgcgccctcg makcngantt ccgcsananc 300 

t 301 



<210> 274 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) - . . (301) 
<223> n - A,T,C or G 



<400> 274 

cttatatact ctttctcaga ggcaaaagag gagatgggta atgtagacaa ttctttgagg 60 

aacagtaaat gattattaga gagaangaat ggaccaagga gacagaaatt aacttgtaaa 120 

tgattctctt tggaatctga atgagatcaa gaggccagct ttagcttgtg gaaaagtcca 180 

tctaggtatg gttgcattct cgtcttcttt tctgcagtag ataatgaggt aaccgaaggc 240 

aattgtgctt cttttgataa gaagctttct tggtcatatc aggaaattcc aganaaagtc 300 

c 301 



<210> 275 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (301) 
<223> n = A, T,C or G 



<400> 275 

tcggtgtcag cagcacgtgg cattgaacat tgcaatgtgg agcccaaacc acagaaaatg 60 

cjggtgaaatt ggccaacttt ctattaactt atgttggcaa ttttgccacc aacagtaagc 120 

tggcccttct aataaaagaa aattgaaagg tttctcacta aacggaatta agtagtggag 180 

LCaauagact cccaggcctc agcgtacctg cccgggcggc cgctcgaagc cgaattctgc 24 0 

agatatccat cacactggcg gncgctcgan catgcatcta gaaggnccaa ttcgccctat 300 

a 301 



<210> 276 
<211> 301 
<212> DNA 
<2 1 3> Homo sapien 



<400> 276 

tgtacacata ctcaataaat aaatgactgc attgtggtat tattactata ctgattatat 60 

r.t.atcatgtg acttctaatt agaaaatgta tccaaaagca aaacagcaga tatacaaaat 120 

ta{aagagaca gaagatagac attaacagat aaggcaactt atacattgag aatccaaatc 180 

caatacattt aaacatttgg gaaatgaggg ggacaaatgg aagccagatc aaatttgtgt 240 
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aaaactattc agtatgtttc ccttgcttca tatctgagaa ggctctcctt caatggggat 300 
9 301 

<210> 277 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (301) 
<223> n - A,T,C or G 

<400> 277 

tttgttgatg tcagtatttt attacttgcg ttatgagtgc tcacctggga aattctaaag 60 

atacagagga cttggaggaa gcagagcaac tgaatttaat ttaaaagaag gaaaacattg 120 

gaatcatggc actcctgata ctttcccaaa tcaacactct caatgcccca ccctcgtcct 180 

caccatagtg gggagactaa agtggccacg gatttgcctt angtgtgcag tgcgttctga 240 

gttcnctgtc gattacatct gaccagtctc ctttttccga agtccntccg ttcaatcttg 300 

^ 301 

<210> 278 
<211> 301 
' <212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (301) 
<223> n = A,T,C or G 

<400> 278 

taccactaca ctccagcctg ggcaacagag caagaccugt. cucaaagcat aadatggaati 60 

aacatatcaa atgaaacagg gaaaatgaag ctgacaattt atggaagcca gggcttgtca 120 

cagtctctac tgttattatg cattacctgg gaatttatat aagcccttaa taataatgcc 18 0 

aatgaacatc tcatgtgtgc tcacaatgtt ctggcactat tataagtgct tcacaggttt 240 

tatgtgttct tcgtaacttt atggantagg tactcggccg cgaacacgct aagccgaatt 300 

^ 301 

<210> 279 

<211> 301 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (301) 
<223> n = A,T,C or G 

<400> 279 

aaagcaggaa tgacaaagct tgcttttctg gtatgttcta ggtgtattgt gacttttact 60 

gttatattaa ttgccaatat aagtaaatat agattatata tgtatagtgt ttcacaaagc 120 

ttagaccttt accttccagc caccccacag tgcttgatat ttcagagtca gtcattggtt 18 0 

atacatgtgt agttccaaag cacataagct agaanaanaa atatttctag ggagcactac 240 

catctgtttt cacatgaaat gccacacaca tagaactcca acatcaattt cattgcacag 300 

^ 301 

<210> 280 
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<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 280 

ggractggag ttttcctccc ctgtgaaaac gtaactactg ttgggagtga attgaggatg 

uaqaaaggtg gtggaaccaa attgtggtca atggaaatag gagaatatgg ttctcactct 

uoagaaaaaa acctaagati: agcccaggta gttgcctgta acttcagttt ttctgcctgg 

gtt:gatata gtttagggtt ggggttagat taagatctaa attacatcag gacaaagaga 

cacjactatta actccacagt taattaagga ggtatgttcc atgtttattt gttaaagcag 



60 
120 
180 
240 
300 
301 



<210> 281 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 281 
aggtacaaga aggggaatgg gaaagagctg 
gccgagcaat ccaaatcctg aatgaagggg 
atgtggtagc aatggcttta tcgggttata 
tgtgtagcac actgcgatta cagctaaata 
tgacaagtga aacaggatct tacgatggag 



ctgctgtggc 


attgttcaac 


ttggatattc 


60 


catcttctga 


aaaaggagat 


ctgaatctca 


120 


cggatgagaa 


gaactccctt 


tggagagaaa 


180 


acccgtattt 


gtgtgtcatg 


tttgcatttc 


240 


ttttgtatga 


aaacaaagt t 


gcagtacctc 


300 








301 



<210> 282 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 282 
c^agqtactac agaattaaaa tactgacaag 
tccaqaaccc aaaaattaag aaattcaaaa 
agcgcagaag caaagcccag gcagaaccat 
cgcagaagca aagcccaggc agaaccatgc 
cagaagcaaa gcccaggcag aacatgctaa 
a 



caagtagttt 


cttggcgtgc 


acgaattgca 


60 


agacattttg 


tgggcacctg 


ctagcacaga 


120 


gctaacctta 


cagctcagcc 


tgcacagaag 


180 


taaccttaca 


gctcagcctg 


cacagaagcg 


240 


ccttacagct 


cagcctgcac 


agaagcacag 


300 








301 



<210> 283 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 283 
avcrgtatac ggcagacaaa 



cact ttgagg 
gt.gcat ctcc 
actt cccagg 
ggaaacatat 

g 



gctttataat 
agacat agta 
tttt atgcaa 
acat ttttaa 



ctttatarag 
aatatgctgc 
aggggttgct 
aaattttgtt 
aaat ctattt 



tgtagagagg 


tgagcgaaag 


gatgcaaaag 


60 


ttgaaaaaaa 


aaatgtgtag 


ttgatactca 


120 


ctgaccaatc 


aggtgatcat 


tttttctatc 


180 


aaattctata 


atggtgat at 


gcatctttta 


240 


tatgtaagaa 


ctgacagacg 


aattt gcttt 


300 








301 



<210> 284 

<211> 301 

<212> DNA 

<213> Homo sapien 



- /! n n ^ ^ Q /I 

caggtacaaa acgctattaa gtggcttaga atttgaacat ttgtggtctt tatttacttt 



60 
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gcttcgtgtg tgggcaaagc aacatcttcc ctaaatatat attaccaaga aaagcaagaa 120 

gcagattagg tttttgacaa aacaaacagg ccaaaagggg gctgacctgg agcagagcat 180 

ggtgagaggc aaggcatgag agggcaagtt tgttgtggac agatctgtgc ctactttatt 240 

actggagtaa aagaaaacaa agttcattga tgtcgaagga tatatacagt gttagaaatt 300 

^ 301 

<210> 285 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> {!)... (301) 
<223> n = A,T,C or G 

<400> 285 

acatcaccat gatcggatcc cccacccatt atacgttgta tgtttacata aatactcttc 60 

aatgatcatt agtgttttaa aaaaaatact gaaaactcct tctgcatccc aatctctaac 120 

caggaaagca aatgctattt acagacctgc aagccctccc tcaaacnaaa ctatttctgg 180 

attaaatatg tctgacttct tttgaggtca cacgactagg caaatgctat ttacgatctg 240 

caaaagctgt ttgaagagtc aaagccccca tgtgaacacg atttctggac cctgtaacag 300 

^ 301 

<210> 286 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 286 

taccactgca ttccagcctg ggtgacagag tgagactccg tctccaaaaa aaactttgct 60 

tgtatattat ttttgcctta cagtggatca ttctagtagg aaaggacagt aagatttttt 120 

atcaaaacgt gtcatgccag taagagargt: tatattcttt tctcatttct tccccaccca 180 

aaaataagct accatatagc ttataagtct caaatttttg ccttttacta aaatgtgatt 240 

gtttctgttc attgtgtatg cttcatcacc tatattaggc aaattccatt ttttcccttg 300 

^ 301 

<210> 287 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 287 

tacagatctg ggaactaaat attaaaaatg agtgtggctg gatatatgga gaatgttggg 60 

cccagaagga acgtagagat cagatattac aacagctttg ttttgagggt tagaaatatg 120 

aaatgatttg gttatgaacg cacagtttag gcagcagggc cagaatcctg accctctgcc 180 

ccgtggttat ctcctcccca gcttggctgc ctcatgttat cacagtattc cattttgttt 240 

gttgcatgtc ttgtgaagcc atcaagattt tctcgtctgt tttcctctca ttggtaatgc 300 

^ * 301 

<210> 288 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 288 

gtacacctaa ctgcaaggac agctgaggaa tgtaatgggc agccgctttt aaagaagtag 60 
agtcaatagg aagacaaatt ccagttccag ctcagtctgg gtatctgcaa agctgcaaaa 120 
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q^.xr[t\cia^ gacaatttca agagaatatt tccttaaagt tggcaatttg gagatcatac 180 
aaaagcatct qcttttgtqa tttaatztag ctcatctggc cactggaaga atccaaacag 240 
t c:: taa ^tttggatga atgcatgatg gaaattcaat aatttagaaa gttaaaaaaa 300 

301 



^;:10> 289 
<211> 301 
'■212> DNA 
<213> Homo sapien 

-220> 

<221> misc_f eature 
<222> (1) . . . (301) 
<223> n = A,T,C or G 



<400> 289 

ggtacactgt ttccatgtta tgtttctaca cattgctacc tcagtgctcc tggaaactta 60 

qcttttgatg tctccaagta gtccaccttc atttaactct ttgaaactgt atcatctttg 120 

ccaagtaaga gtggtggcct atttcagctg ctttgacaaa atgactggct cctgacttaa 180 

cgttctataa atgaatgtgc tgaagcaaag tgcccatggt ggcggcgaan aagagaaaga 240 

tgtgttttgt tttggactct ctgtggtccc ttccaatgct gtgggtttcc aaccagngga 300 



301 

<210> 290 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) .7. (301) 
<223> n = A,T,C or G 



<400> 290 

acdc-Lgagct cttcttgata aatatacaga atgcttggca tatacaagat tctatactac 60 

tgactgatct gttcatttct ctcacagctc ttacccccaa aagcttttcc accctaagtg 120 

t:totQacctc cttttctaat cacagtaggg atagaggcag anccacctac aatgaacatg 180 

gagitctatc aagaggcaga aacagcacag aatcccagtt ttaccattcg ctagcagtgc 240 

tgccttqaac aaaaacattt ctccatgtct cattttcttc atgcctcaag taacagtgag 300 

^ 301 



<210> 291 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 291 

caggtaccaa tttcttctat cctagaaaca tttcatttta tgttgttgaa acataacaac 60 

tatatcagct agattttttt tctatgcttt acctgctatg gaaaatttga cacattctgc 120 

tttactcttt tgtttatagg tgaatcacaa aatgtatttt tatgtattct gtagttcaat 180 

agccatggct gtttacttca tttaatttat ttagcataaa gacattatga aaaggcctaa 240 

acatgagctt cacttcccca ctaactaatt agcatctgtt atttcttaac cgtaatgcct 300 

301 



<210> 292 
<211> 301 
<212> DNA 
<213> Homo sapien 
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<220> 

<221> inisc_f eature 
<222> (1) . . . (301) 
<223> n = A^T,C or G 



<400> 292 

accttttagt agtaatgtct aataataaat aagaaatcaa ttttataagg tccatatagc 60 

tgtattaaat aatttttaag tttaaaagat aaaataccat cattttaaat gttggtattc 120 

aaaaccaaag natataaccg aaaggaaaaa cagatgagac ataaaatgat ttgcnagatg 180 

ggaaatatag tasttyatga atgttnatta aattccagtt ataatagtgg ctacacactc 240 

tcactacaca cacagacccc acagtcctat atgccacaaa cacatttcca taacttgaaa 300 

3 301 



<210> 293 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 293 

ggtaccaagt gctggtgcca gcctgttacc tgttctcact gaaaagtctg gctaatgctc 60 

ttgtgtagtc acttctgatt ctgacaatca atcaatcaat ggcctagagc actgactgtt 120 

aacacaaacg tcactagcaa agtagcaaca gctttaagtc taaatacaaa gctgttctgt 180 

gtgagaattt tttaaaaggc tacttgtata ataacccttg tcatttttaa tgtacctcgg 240 

ccgcgaccac gctaagccga attctgcaga tatccatcac actggcggcc gctcgagcat 300 

g 301 



<210> 294 
<211> 301 
<212> DNA 
<213> Homo sapien 

<2 20> 

<221> misc_feature 
<222> (1) . . . (301) 
<223> n = A,T,C or G 



<400> 294 

tgacccataa caatatacac tagctatctt tttaactgtc catcattagc accaatgaag 60 

attcaataaa attaccttta ttcacacatc tcaaaacaat tctgcaaatt cttagtgaag 120 

tttaactata gtcacaganc ttaaatattc acattgtttt ctatgtctac tgaaaataag 180 

ttcactactt ttctgggata ttctttacaa aatcttatta aaattcctgg tattatcacc 240 

cccaattata cagtagcaca accaccttat gtagttttta catgatagct ctgtagaggt 300 

301 



<210> 295 
<211> 305 
<212> DNA 
<213> Homo sapien 



<400> 295 

gtactctttc tctcccctcc tctgaattta attctttcaa cttgcaattt gcaaggatta 60 

cacatttcac tgtgatgtat attgtgttgc aaaaaaaaaa gtgtctttgt ttaaaattac 120 

ttggtttgtg aatccatctt gctttttccc cattggaact agtcattaac ccatctctga 180 

actggtagaa aaacrtctga agagctagtc tatcagcatc tgacaggtga attggatggt 240 

tctcagaacc atttcaccca gacagcctgt ttctatcctg tttaataaat tagtttgggt 300 

tctct 305 
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<-^10> 296 
<2i:> 301 
<212> DNA 
<213> Homo sapien 



<400> 296 

ciggtac-^atg ggaagctgcr aaaataatat ttgatagtaa aagratgtaa tqtqctatcz 60 

cacc-caqtag taaactaaaa ataaactgaa actttatgqa atctgaaatt attttccttg 120 

--.r t uaataga attaataaac caatatgagg aaacatgaaa ccatgcaatc tactatcaac 180 

rtrqaaaaag tgattgaacg aaccacttag ctttcagatg atgaacactg ataagtcatt 240 

vJtcatt act ataaatttta aaatctgtta ataagatggc ctatagggag gaaaaagggg 300 

c 301 



<210> 297 
<211> 300 
<212> DNA 
<213> Homo sapien 

<220> 

<221> mi sc_f eature 
<222> (1) ... (300) 
<223> n = A,T,C or G 



<400> 297 

actgagtttt aactggacgc caagcaggca aggctggaag gttttgctct ctttgtgcta 60 

aaggttttga aaaccttgaa ggagaatcat tttgacaaga agtacttaag agtctagaga 120 

acaaagangt gaaccagctg aaagctctcg ggggaanctt acatgtgttg ttaggcctgt 180 

tccatcattg ggagtgcact ggccatccct caaaatttgt ctgggctggc ctgagtggtc 240 

accgcacctc ggccgcgacc acgctaagcc gaattctgca gatatccatc acactggcgg 300 



<210> 298 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (301) 
<223> n == A, C or G 



<400> 298 

tatggggttt gtcacccaaa agctgatgct gagaaaggcc tccctggggc ccctcccgcg 60 

ggcatctgag agacctggtg ttccagtgtt tctggaaatg ggtcccagtg ccgccggctg 120 

tgaaqctctc agatcaatca cgggaagggc ctggcggtgg tggccacctg gaaccaccct 180 

atcctgtctg tttacatttc actaycaggt tttctctggg cattacnatt tgttccccta 240 

caacagtgac ctgtgcattc tgctgtggcc tgctgtgtct gcaggtggct ctcagcgagg 300 

^ 301 



<210> 299 
<211> 301 
<212> DNA 
<213> Homo sapien 



<400> 299 

gttttgagac ggagtttcac tcttgttgcc cagactggac tgcaatggca gggtctctgc 60 

t oactgcacc ctctgcctcc caggttcgag caattctcct gcctcagcct cccaggtagc 120 

-3 ^ ^ (_ ^ ^ w wi^v^y avvi^cio<^^i^^a yL-,i„c3ciLL-cci_ u-LguciuuLLc d g u d g ci ga c g ±ow 

qcigtttcgcc atgttggcca gctggtctca aactcctgac ctcaagcgac ctgcctgcct 240 
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cggcctccca aagtgctgga attataggca tgagtcaaca cgcccagcct aaagatattt 300 
t 301 



<210> 300 
<211> 301 

<212> DNA 

<213> Homo sapien 

<400> 300 

attcagtttt atttgctgcc ccagtatctg taaccaggag 
tatgtcccac acccactggg aaaggctccc acctggctac 
gctgcattcc acaaggttct cagcctaatg agtttcacta 
gtaaagcaag accatgacat tcccccacgg aaatcagagt 
tataaagcct gcctctaaca gtccttgctt cttcacacca 

g 



tgccacaaaa 
ttcctctatc 
cctgccagtc 
ttgccccacc 
atcccgagcg 



tcttgccaga 
agctgggtca 
tcaaaactta 
gtcttgttac 
catcccccat 



60 
120 
180 
240 
300 
301 



<210> 301 
<211> 301 
<212> DNA 
<213> Homo sapien 



<40O> 301 
ttaaattttt gagaggataa aaaggacaaa 
agaggacccc aggtctccaa gcaaccacat 
gggaactcac aaagaccctc agagctgaga 
ctcagagctg agacacccac aacagtggga 
cacaacagca cctcgttcag ctgccacatg 
t 



taatctagaa 


atgtgtcttc 


ttcagtctgc 


60 


ggtcaagggc 


atgaataatt 


aaaagttggt 


120 


cacccacaac 


agtgggagct 


ca caaagacc 


180 


gctcacaaag 


accctcagag 


ctgagacacc 


240 


tgtgaataag 


gatgcaatgt 


ccagaagtgt 


300 








301 



<210> 302 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 302 

aggtacacat ttagcttgtg gtaaatgact cacaaaactg attttaaaat 
tgaattttga aaattactac ttaatcctaa ttcacaataa caatggcatt 
ttgagttggt tcttagtatt atttatggta aataggctct taccacttgc 
ccacatcatt aatgactgac ttcccagtaa ggctctctaa ggggtaagta 
caggatttga gatgctaagg ccccagagat cgtttgatcc aaccctctta 

g 



caagttaatg 
aaggtttgac 
aaataactgg 
ggaggatcca 
ttttcagagg 



60 
120 
180 
240 
300 
301 



<210> 303 
<211> 301 
<212> DNA 
<213> Homo sapien 

<400> 303 

aggtaccaac tgtggaaata ggtagaggat cattttttct ttccatatca actaagttgt 
atattgtttt ttgacagttt aacacatctt cttctgtcag agattctttc acaatagcac 
tggctaatgg aactaccgct tgcatgttaa aaatggtggt ttgtgaaatg atcataggcc 
agtaacgggt atgtttttct aactgatctt ttgctcgttc caaagggacc tcaagacttc 
catcgatttt atatctgggg tctagaaaag gagttaatct gttttccctc ataaattcac 
c 

<210> 304 
<211> 301 
<212> DNA 



60 
120 
180 
240 
300 
301 
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<213> Homo sapien 



<400> 304 

acai-guatgt tattttgcag actgtcaacc tgaatttgta tttgcttgac attgcctaat 60 

tattagtttc agtttcagct tacccacttt ttgtctgcaa catgcaraas agacagtgcc 120 

cLLtttagtg tatcatatca ggaatcatct cacattggtt tgtgccatta ctggtgcaqt 180 

qacttr'-:agc cacttqggta aggtggagtt ggccatatgt ctccactgca aaattactga 240 

ctttccLttt gtaattaata agtgtgtgtg tgaagattct ttgagatgag gtatatatct 300 

301 



<2i0> 305 
<211> 301 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) ... {301) 
<223> n - A,T,C or G 



<400> 305 

gangtacagc gtggtcaagg taacaagaag aaaaaaatgt gagtggcatc ctgggatgag 60 

cagggggaca gacctggaca gacacgttgt catttgctgc tgtgggtagg aaaatgggcg 120 

'caaaggagga gaaacagata caaaatctcc aactcagtat taaggtattc tcatgcctag 180 

aatattggta gaaacaagaa tacattcata tggcaaataa ctaaccatgg tggaacaaaa 240 

ttctgggatt taagttggat accaangaaa ttgtattaaa agagctgttc atggaataag 300 

s 301 



<210> 306 

<211> 8 

<212> PRT 

<213> Homo sapien 

<400> 306 
Val Leu Gly Trp Val Ala Glu Leu 
1 5 

<210> 307 
<211> 637 
<212> DNA 
<213> Homo sapien 



<400> 307 

acagggratg aagggaaagg gagaggatga ggaagccccc ctggggattt ggtttggtcc 60 

ttgtgatcag gtggtctatg gggcttatcc ctacaaagaa gaatccagaa ataggggcac 120 

attgaggaat gatacttgag cccaaagagc attcaatcat tgttttattt gccttmtttt 180 

cacaccattg gtgagggagg gattaccacc ctggggttat gaagatggtt gaacacccca 240 

cacatagcac cggagatatg agatcaacag tttcttagcc atagagattc acagcccaga 300 

gcagqaggac gcttgcacac catgcaggat gacatggggg atgcgctcgg gattggtgtg 360 

aagaagcaag gactgttaga ggcaggcttt atagtaacaa gacggtgggg caaactctga 420 

tttccgtggg ggaatgtcat ggtcttgctt tactaagttt tgagactggc aggtagtgaa 480 

actcattagg ctgagaacct tgtggaatgc acttgaccca sctgatagag gaagtagcca 540 

qgtqggagcc tttcccagtg ggtgtgggac atatctggca agattttgtg gcactcctgg 600 

ttacagatac tggggcagca aataaaactg aatcttg 637 



<210> 308 
<211> 647 
<212> DNA 
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<213> Homo sapien 
<220> 

<221> inisc_f eature 
<222> (1) . . . (647) 

<223> n - A, T,C or G 

<400> 308 

acgattttca ttatcatgta aatcgggtca ctcaaggggc caaccacagc tgggagccac 60 

tgctcagggg aaggttcata tgggactttc tactgcccaa ggttctatac aggatataaa 120 

ggngcctcac agtatagatc tggtagcaaa gaagaagaaa caaacactga tctctttctg 180 

ccacccctct gaccctttgg aactcctctg accctttaga acaagcctac ctaatatctg 240 

ctagagaaaa gaccaacaac ggcctcaaag gatctcttac catgaaggtc tcagctaatt 300 

cttggctaag atgtgggttc cacattaggt tctgaatatg gggggaaggg tcaatttgct 360 

cattttgtgt gtggataaag tcaggatgcc caggggccag agcagggggc tgcttgcttt 420 

gggaacaatg gctgagcata taaccatagg ttatggggaa caaaacaaca tcaaagtcac 480 

tgtatcaatt gccatgaaga cttgagggac ctgaatctac cgattcatct taaggcagca 540 

ggaccagttt gagtggcaac aatgcagcag cagaatcaat ggaaacaaca gaatgattgc 600 

aatgtccttt tttttctcct gcttctgact tgataaaagg ggaccgt 647 

<210> 309 
<211> 460 
<212> DNA 
<213> Homo sapien 

<400> 309 

actttatagt ttaggctgga cattggaaaa aaaaaaaagc cagaacaaca tgtgatagat 60 

aatatgattg gctgcacact tccagactga tgaatgatga acgtgatgga ctattgtatg 120 

gagcacatct tcagcaagag ggggaaatac tcatcatttt tggccagcag ttgtttgatc 180 

accaaacatc atgccagaat actcagcaaa ccttcttagc tcttgagaag tcaaagtccg 240 

ggggaattta ttcctggcaa ttttaattgg actccttatg tgagagcagc ggctacccag 300 

ctggggtggt ggagcgaacc cgtcactagt ggacatgcag tggcagagct cctggtaacc 360 

acctagagga atacacaggc acatgtgtga tgccaagcgt gacacctgta gcactcaaat 420 

ttgtcttgtt tttgtctttc ggtgtgtaag attcttaagt 460 

<210> 310 
<211> 539 
<212> DNA 
<213> Homo sapien 

<400> 310 

acgggactta tcaaataaag ataggaaaag aagaaaactc aaatattata ggcagaaatg 60 

ctaaaggttt taaaatatgt caggattgga agaaggcatg gataaagaac aaagttcagt 120 

taggaaagag aaacacagaa ggaagagaca caataaaagt cattatgtat tctgtgagaa 180 

gtcagacagt aagatttgtg ggaaatgggt tggtttgttg tatggtatgt attttagcaa 240 

taatctttat ggcagagaaa gctaaaatcc tttagcttgc gtgaatgatc acttgctgaa 300 

ttcctcaagg taggcatgat gaaggagggt ttagaggaga cacagacaca atgaactgac 360 

ctagatagaa agccttagta tactcagcta ggaatagtga ttctgagggc acactgtgac 420 

atgattatgt cattacatgt atggtagtga tggggatgat aggaaggaag aacttatggc 480 

atattttcac ccccacaaaa gtcagttaaa tattgggaca ctaaccatcc aggtcaaga 539 

<210> 311 

<211> 526 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc feature 
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<222> 1 1 ^ . . . ( 526} 
<223> n - A, C or G 



<400> 311 

ca.^ritt t gag ccaatgacat agaattttac aaatcaagaa gcttattctg gggccatttc 60 

ttttgacgtt ttctctaaac tactaaagag gcattaatga tccataaatt atattatcta 120 

.-arrtacagc atttaaaatg tgttcagcat gaaatattag ctacagggga agctaaataa 180 

atraaacatg gaataaagat ttgtccttaa atataatcta caagaagact ttgatatttg 240 

rttttcacaa gtgaagcatt cttataaagt gtcataacct ttttggggaa actatgggaa 300 

aaaatgggga aactctgaag ggttttaagt atcttacctg aagctacaga ctccataacc 360 

^ Cuctttaca gggagctcct gcagccccta cagaaatgag tggctgagat tcttgattgc 420 

acagcaagag cttctcatct aaaccctttc cctttttagt atctgtgtat caagtataaa 480 

aattctataa acigtagtnt acttatttta atccccaaag cacagt 526 



<210> 312 

<211> 500 

<212> i:'r;A 

<213> Homo sap i en 
<220> 

<221> rr.:.sr_ feature 

<222> ( 1 ) . . . (500) 

<223> n - A,T,C or G 



<4D0> 312 

cctctctctc cccaccccct gactctagag aactgggttt tctcccagta ctccagcaat 60 

tcatttctga aagcagttga gccactttat tccaaagtac actgcagatg ttcaaactct 120 

ccatttctct ttcccttcca cctgccagtt ttgctgactc tcaacttgtc atgagtgtaa 180 

gcattaagga cattatgctt cttcgattct gaagacaggc cctgctcatg gatgactctg 240 

gcttcttagg aaaatatttt tcttccaaaa tcagtaggaa atctaaactt atcccctctt 300 

tgcagatgtc tagcagcttc agacatttgg ttaagaaccc atgggaaaaa aaaaaatcct 360 

tgctaatgtg gtttcctttg taaaccanga ttcttatttg nctggtatag aatatcagct 420 

ctgaacgtgt ggtaaagatt tttgtgtttg aatataggag aaatcagttt gctgaaaagt 480 

tagtcttaat tatctattgg 500 



<210> 313 
<211> 718 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (718) 
<223> n = A,T,C or G 



<400> 313 

ggagatttgt gtggtttgca gccgagggag accaggaaga tctgcatggt gggaaggacc 60 

tgatgataca gaggtgagaa ataagaaagg ctgctgactt taccatctga ggccacacat 120 

ctgctgaaat ggagataatt aacatcacta gaaacagcaa gatgacaata taatgtctaa 180 

gtagtgacat gtttttgcac atttccagcc cttttaaata tccacacaca caggaagcac 240 

aaaaggaagc acagagatcc ctgggagaaa tgcccggccg ccatcttggg tcatcgatga 300 

gcctcgccct gtgcctgntc ccgcttgtga gggaaggaca ttagaaaatg aattgatgtg 360 

ttccttaaag gatgqcagga aaacagatcc tgttgtggat atttatttga acgggattac 420 

agatttgaaa tqaagtcaca aagtgagcat taccaatgag aggaaaacag acgagaaaat 480 

cttgatggtt cacaagacat gcaacaaaca aaatggaata ctgtgatgac acgagcagcc 540 

aactggggag gagaT:accac ggggcagagg tcaggattct ggccctgctg cctaactgtg 600 

cgttatacca atcatttcta trtctaccct caaacaagct gtngaatatc tgacttacgg 660 

ttcttntggc ccacattttc atnatccacc ccntcntttt aannttantc caaantgt 718 
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<210> 314 
<211> 358 
<212> DNA 
<213> Homo sapien 

<400> 314 

gtttatttac attacagaaa aaacatcaag acaatgtata ctatttcaaa tatatccata 60 

cataatcaaa tatagctgta gtacatgttt tcattggtgt agattaccac aaatgcaagg 120 

caacatgtgt agatctcttg tcttattctt ttgtctataa tactgtattg tgtagtccaa 180 

gctctcggta gtccagccac tgtgaaacat gctcccttta gattaacctc gtggacgctc 240 

ttgttgtatt gctgaactgt agtgccctgt attttgcttc tgtctgtgaa ttctgttgct 300 

tctggggcat ttccttgtga tgcagaggac caccacacag atgacagcaa tctgaatt 358 

<210> 315 

<211> 341 

<212> DNA 

<213> Homo sapien 

<400> 315 

taccacctcc ccgctggcac tgatgagccg catcaccatg gtcaccagca ccatgaaggc 60 

ataggtgatg atgaggacat ggaatgggcc cccaaggatg gtctgtccaa agaagcgagt 120 

gacccccatt ctgaagatgt ctggaacctc taccagcagg atgatgatag ccccaatgac 180 

agtcaccagc tccccgacca gccggatatc gtccttaggg gtcatgtagg cttcctgaag 240 

tagcttctgc tgtaagaggg tgttgtcccg ggggctcgtg cggttattgg tcctgggctt 300 

gsgggggcgg tagatgcagc acatggtgaa gcagatgatg t 341 

<210> 316 
<211> 151 
<212> DNA 
<213> Homo sapien 

<400> 316 

agactgggca agactcttac gccccacact gcaatttggt cttgttgccg tatccattta 60 

tgtgggcctt tctcgagttt ctgattataa acaccactgg agcgatgtgt tgactggact 120 

cattcaggga gctctggttg caatattagt t 151 

<210> 317 

<211> 151 

<212> DNA 

<213> Homo sapien 

<400> 317 

agaactagtg gatcctaatg aaatacctga aacatatatt ggcatttatc aatggctcaa 60 

atcttcattt atctctggcc ttaaccctgg ctcctgaggc tgcggccagc agatcccagg 120 

ccagggctct gttcttgcca cacctgcttg a 151 

<210> 318 
<211> 151 
<212> DNA 
<213> Homo sapien 

<400> 318 

actggtggga ggcgctgttt agttggctgt tttcagaggg gtctttcgga gggacctcct 60 

gctgcaggct ggagtgtctt tattcctggc gggagaccgc acattccact gctgaggctg 120 

tgggggcggt ttatcaggca gtgataaaca t 151 

<210> 319 
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<211> 151 
<212> DNA 
<213> Homo sacien 



<400> 319 



ciactagtgga tccagagcta taggtacagt gtgatctcag ctttgcaaac acattttcta 
-^alaqatagt actaggtatt aatagatatg taaagaaaga aatcacacca ttaataatgg 
'^acigattggg tttatgtgat tttagtgggt a 



60 
120 
151 



<210> 320 
<211> 150 
<212> DNA 
<213> Homo sapien 

<400> 320 

aactagtgga tccactagtc cagtgtggtg gaattccatt gtgttggggt tctagatcgc 60 

gagcggctgc cctttttttt tttttttttg ggggggaatt tttttttttt aatagttatt 120 

gaatgttcta cagcttacag taaataccat 150 

<210> 321 
<211> 151 
<212> DNA 
<213> Homo sapien 

<400> 321 

agcaactttg tttttcatcc aggttatttt aggcttagga tttcctctca cactgcagtt 60 

tagggtggca ttgtaaccag ctatggcata ggtgttaacc aaaggctgag taaacatggg 120 

tgcctctgag aaatcaaagt cttcatacac t 151 

<210> 322 
<211> 151 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (151) 
<223> n = A, T,C or G 

<400> 322 

atccagcatc ttctcctgtt tcttgccttc ctttttcttc ttcttasatt ctgcttgagg 60 

tttgggcttg gtcagtttgc cacagggctt ggagatggtg acagtcttct ggcattcggc 120 

attgtgcagg gctcgcttca nacttccagt t 151 

<210> 323 
<211> 151 
<212> DNA 
<213> Hoino sapien 

<220> 

<221> misc _f eature 
<222> (1) .7. (151) 
<223> n A,T, C or G 

<400> 323 

i uaqgacttg tkttcttttt ctttattttt aatcctctta ckttgtaaat atattgccta 60 

r.ci gci ct cant tactacccag tttgtggttt twtgggagaa atgtaactgg acagttagct 120 

gttcaatyaa aaagacactt ancccatgtg g 151 
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<210> 324 
<211> 461 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (461) 
<223> n = A, T, C or G 



acctgt 
agaagt 
agagtt 
gcgaac 
ctcata 
cacaca 
gccacc 
aaaaac 



<400 
gtgg 

acta 
ct ca 
cagg 
aatg 
atgc 
gcac 



> 324 
aat ttcagct 
agctaaagga 
cqaatcccat 
ct t ctagact 
gat atcaaaa 
caa tagt tgg 
accatggca t 
aagagcccct 



ttcctcatgc 
atccaggttg 
cttggttcca 
ttcacggtgg 
taccctttgt 
tcactgcatt 
gccagagttc 
gccctgccct 



aaaaggattt 
ttggttggac 
gctatatcac 
gacgaaacgg 
gctacccagg 
tttacctgaa 
aacactgttg 
agctgangca 



tgtatccccg 
tgttaatacc 
tgacagcatg 
gttcagaaac 
ccctggggaa 
ccaaagctaa 
ctcttgaaaa 
c 



gcctacttga 
tttgatgaaa 
gtagaagact 
tgccaggggc 
tcaggtgact 
acccggtgtt 
ttgggtctga 



60 
120 
180 
240 
300 
360 
420 
461 



<21C> 325 
<211> 400 
<212> DNA 
<213> Homo sapien 



<400 
acactgtttc 
tttgatgtct 
agtaagagtg 
tctataaatg 
gttttgtttt 
gtccctttrg 
ctggccaagc 



> 325 
catgttatgt 
ccaagtagt c 
gtggcctatt 
aatgtgctga 
ggactctctg 
catcgccaag 
aggctggttt 



ttctacacat 
caccttcatt 
tcagctgctt 
agcaaagtgc 
tggtcccttc 
tgccataacc 
gcaagaatga 



tgctacct ca 
taactctttg 
tgacaaaatg 
ccatggtggc 
caatgctgtg 
atgagcacta 
aatgaatgat 



gtgctcctgg 
aaactgtatc 
actggctcct 
ggcgaagaag 
ggtttccaac 
cgctaccatg 



aaacttagct 
atctttgcca 
gacttaacgt 
agaaagatgt 
caggggaagg 
gttctgcctc 



60 
120 
180 
240 
300 
360 
400 



<210> 326 

<211> 1215 

<212> DNA 

<213> Homo sapien 



<400 
ggaggactgc 
gttctgctcg 
gaactcctac 
ccagatggtg 
taacgacctc 
catcagcatt 
tctgctggcg 
tgaggaggtc 
cggagggcaa 
cgggtacttg 
aggtgtctac 
ttaactctgg 
tcaggaatat 
ctcctccctc 
agacccccca 
ggagtccaga 
cctccctcag 



> 326 
agcccgcact 
ggcgtcctgg 
accatcgggc 
gaggccagcc 
atgctcatca 
gcttcgcagt 
aacggcagaa 
tgcagtaagc 
gaccagaagg 
cagggccttg 
accaacctct 
ggactgggaa 
ctgttcccag 
aaaccaaggg 
gcccctcctc 
ccccccagcc 
actcagaggt 



cgcagccctg 
tgcatccgca 
tgggcctgca 
t ctccgtacg 
agttggacga 
gccctaccgc 
tgcctaccgt 
tctatgaccc 
actcctgcaa 
tgtctttcgg 
gcaaattcac 
cccatgaaat 
cccctcctcc 
tacagatccc 
cctcagaccc 
cctcctccct 
ccaagccccc 



gcaggcggca 
gtgggtgctg 
cagtcttgag 
gcacccagag 
atccgtgtcc 

ggggaactct 

gctgcagtgc 
gctgtaccac 
cggtgactct 
aaaagccccg 
tgagtggata 
tgacccccaa 
ctcaggccca 
cagcccctcc 
aggagtccag 
cagacccagg 
aacccctcct 



ctggtcatgg 
tcagccgcac 
gccgaccaag 
tacaacagac 
gagtctgaca 
tgcctcgttt 
gtgaacgtgt 
cccagcatgt 
ggggggcccc 
tgtggccaag 
gagaaaaccg 
atacatcctg 
ggagtccagg 
tccctcagac 
cccctcctcc 
ggtccaggcc 
tccccagacc 



aaaacgaatt 
actgtttcca 
agccagggag 
ccttgctcgc 
ccatccggag 
ctggctgggg 
<^ggtggtgtc 
tctgcgccgg 
tgatctgcaa 
ttggcgtgcc 
tccaggccag 
cggaaggaat 
ccccca.qccc 
ccaggagtcc 
ctcagaccca 
cccaacccct 
cagaggtcca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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aQt cccaacc cctcctccct cagacccagc qgtccaatgc cacctagact ctccctgtac 1080 

acagtgcccc cttgtggcac gttgacccaa ccttaccagt tggtttttca ttttttgtcc 1140 

^lT,;jcccta gatccagaaa taaagtctaa gagaagcgca aaaaaaaaaa aaaaaaaaaa 1200 

v^^W'iaaaaaaaaaaa 1215 

<210> 327 

<211> 220 

<212> PRT 

<213> Hoir.o sapien 



<400> 327 



Giu 


Asp 


Cys 


Ser 


Pre 


His 


Ser 


Gin 


Pro 


Trp 


Gin 


Ala 


Ala 


Leu 


Val 


Met 


1 








5 










10 










15 




u 


Asn 


Glu 


Leu 
20 


Phe 


Cys 


Ser 


Gly 


Val 
25 


Leu 


Val 


His 


Pro 


Gin 
30 


Trp 


Val 


Leu 


Ser 


Ala 
35 


Ala 


His 


Cys 


Phe 


Gin 
40 


Asn 


Ser 


Tyr 


Thr 


He 
45 


Gly 


Leu 


Gly 


Leu 


His 
50 


Ser 


Leu 


Glu 


Ala 


Asp 
55 


Gin 


Glu 


Pro 


Gly 


Ser 
60 


Gin 


Met 


Val 


Glu 


Ala 


Ser 


Leu 


Ser 


Val 


Arg 


His 


Pro 


Glu 


Tyr 


Asn 


Arg 


Pro 


Leu 


Leu 


Ala 


65 










70 










75 










80 


Asn 


Asp 


I.eu 


Met 


Leu 
85 


He 


Lys 


Leu 


Asp 


Glu 
90 


Ser 


Val 


Ser 


Glu 


Ser 
95 


Asp 


^hr 


He 


Arg 


Ser 
100 


He 


Ser 


He 


Ala 


Ser 
105 


Gin 


Cys 


Pro 


Thr 


Ala 
110 


Gly 


Asn 


Ser 


Cys 


Leu 
115 


Val 


Ser 


Gly 


Trp 


Gly 

120 


Leu 


Leu 


Ala 


Asn 


Gly 

125 


Arg 


Met 


Pro 


Thr 


Val 
130 


Leu 


Gin 


Cys 


Val 


Asn 
135 


Val 


Ser 


Val 


Val 


Ser 
140 


Glu 


Glu 


Val 


Cys 


Ser 


Lys 


Leu 


Tyr 


Asp 


Pro 


Leu 


Tyr 


His 


Pro 


Ser 


Met 


Phe 


Cys 


Ala 


Gly 


145 










150 










155 










160 


Gly 


Gly 


Gin 


Asp 


Gin 
165 


Lys 


Asp 


Ser 


Cys 


Asn 
170 


Gly 


Asp 


Ser 


Gly 


Gly 
175 


Pro 


Leu 


He 


Cys 


Asn 

180 


Gly 


Tyr 


Leu 


Gin 


Gly 
185 


Leu 


Val 


Ser 


Phe 


Gly 
190 


Lys 


Ala 


Pro 


Cys 


Gly 
195 


Gin 


Val 


Gly 


Val 


Pro 
200 


Gly 


Val 


Tyr 


Thr 


Asn 
205 


Leu 


Cys 


Lys 


Phe 


Thr 

210 


Glu 


Trp 


He 


Glu 


Lys 
215 


Thr 


Val 


Gin 


Ala 


Ser 
220 











<210> 328 
<211> 234 
<212> DNA 
<213> Homo sapien 

<400> 328 

cgctcgtctc tggtagctgc agccaaatca taaacggcga ggactgcagc ccgcactcgc 60 

agccctggca ggcggcactg gtcatggaaa acgaattgtt ctgctcgggc gtcctggtgc 120 

atocgcagtg ggtgctgtca gccacacact gtttccagaa ctcctacacc atcgggctgg 180 

gcctgcacag tcttgaggcc gaccaagagc cagggagcca gatggtggag gcca 234 

<210> 329 

--^211> 77 

<212> PRT 

<213> Homo sapien 

-'400> 329 

Leu Val Ser Gly Ser Cys Ser Gin He He Asn Gly Glu Asp Cys Ser 
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15 10 15 

Pro His Ser Gin Pro Trp Gin Ala Ala Leu Val Met Glu Asn Glu Leu 

20 25 30 

Phe Cys Ser Gly Val Leu Val His Pro Gin Trp Val Leu Ser Ala Thr 

35 40 45 

His Cys Phe Gin Asn Ser Tyr Thr lie Gly Leu Gly Leu His Ser Leu 

^0 55 60 

Glu Ala Asp Gin Glu Pro Gly Ser Gin Met Val Glu Ala 
65 70 75 

<210> 330 

<211> 70 

<212> DNA 

<213> Homo sapien 

<400> 330 

cccaacacaa tggcccgatc ccatccctga ctccgccctc aggatcgctc gtctctggta 60 
gctgcagcca yg 

<210> 331 

<211> 22 

<212> PRT 

<213> Homo sapien 

<400> 331 

Gin His Asn Gly Pro lie Pro Ser Leu Thr Pro Pro Ser Gly Ser Leu 

15 10 15 

Val Ser Gly Ser Cys Ser 
20 

<210> 332 
<211> 2507 

<213> Homo sapien 

<400> 332 

tggtgccgct gcagccggca gagatggttg agctcatgtt cccgctgttg ctcctccttc 60 

tgcccttcct tctgtatatg gctgcgcccc aaatcaggaa aatgctgtcc agtggggtgt 120 

gtacatcaac tgttcagctt cctgggaaag tagttgtggt cacaggagct aatacaggta 180 

tcgggaagga gacagccaaa gagctggctc agagaggagc tcgagtatat ttagcttgcc 24 0 

gggatgtgga aaagggggaa ttggtggcca aagagatcca gaccacgaca gggaaccagc 300 

aggtgttggt gcggaaactg gacctgtctg atactaagtc tattcgagct tttgctaagg 360 

gcttcttagc tgaggaaaag cacctccacg ttttgatcaa caatgcagga gtgatgatgt 420 

gtccgtactc gaagacagca gatggctttg agatgcacat aggagtcaac cacttgggtc 480 

acttcctcct aacccatctg ctgctagaga aactaaagga atcagcccca tcaaggatag 540 

taaatgtgtc ttccctcgca catcacctgg gaaggatcca cttccataac ctgcagggcg 600 

agaaattcta caatgcaggc ctggcctact gtcacagcaa gctagccaac atcctcttca 660 

cccaggaact ggcccggaga ctaaaaggct ctggcgttac gacgtattct gtacaccctg 720 

gcacagtcca atctgaactg gttcggcact catctttcat gagatggatg tggtggcttt 780 

tctccttttt catcaagact cctcagcagg gagcccagac cagcctgcac tgtgccttaa 84 0 

cagaaggtct tgagattcta agtgggaatc atttcagtga ctgtcatgtg gcatgggtct 900 

ctgcccaagc tcgtaatgag actatagcaa ggcggctgtg ggacgtcagt tgtgacctgc 960 

tgggcctccc aatagactaa caggcagtgc cagttggacc caagagaaga ctgcagcaga 1020 

ctacacagta cttcttgtca aaatgattct ccttcaaggt tttcaaaacc tttagcacaa 1080 

agagagcaaa accttccagc cttgcctgct tggtgtccag ttaaaactca gtgtactgcc 114 0 

agattcgtct aaatgtctgt catgtccaga tttactttgc ttctgttact gccagagtta 1200 

ctagagatat cataatagga taagaagacc ctcatatgac ctgcacagct cattttcctt 1260 

Gtgaaagaaa ctactaccta ggagaatcta agctatagca gggatgattt atgcaaattt 1320 
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gaactagctt ctttgttcac aattcagttc ctcccaacca accagtcttc acttcaagag 1380 

qgccacactg caacctcagc ttaacatgaa taacaaagac tggctcagga gcagggctng 1440 

T'ccaggcatg gtggatcacc ggaggtcagt agttcaagac cagcctggcc aacatggtga 1500 

:-^3Coccacct ctactaaaaa ttgtgtatat ctttgtgtgt cttcctgttt atgtgtgcca 1560 

agggagtatt ttcacaaagt tcaaaacagc cacaataatc agagatggag caaaccagtg 1620 

ccatccagtc tttatgcaaa tgaaatgctg caaagggaag cagattctgt atatgttggt 1680 

:iactacccac caagagcaca tgggtagcag ggaagaagta aaaaaagaga aggagaatac 1740 

tggaagataa tgcacaaaat gaagggacta gttaaggatt aactagccct ttaaggatta 1800 

actagttaag gattaatagc aaaagayatt aaatatgcta acatagctat ggaggaattg 1860 

agggcaagca cccaggactg atgaggtctt aacaaaaacc agtgtggcaa aaaaaaaaaa 1920 

aar^iaaaaaa aaaaatccta aaaacaaaca aacaaaaaaa acaattcttc attcagaaaa 1980 

dttatcttag ggactgatat tggtaattat ggtcaattta ataatatttt ggggcatttc 2040 

cttacattgt cttgacaaga ttaaaatgtc tgtgccaaaa ttttgtattt tatttggaga 2100 

cttcttatca aaagtaatgc tgccaaagga agtctaagga attagtagtg ttcccatcac 2160 

ttgtttggag tgtgctattc taaaagattt tgatttcctg gaatgacaat tatattttaa 2220 

ctttggtggg ggaaagagtt ataggaccac agtcttcact tctgatactt gtaaattaat 2280 

cttttattgc acttgttttg accattaagc tatatgttta gaaatggtca ttttacggaa 2340 

aaattagaaa aattctgata atagtgcaga ataaatgaat taatgtttta cttaatttat 2400 

attgaactgt caatgacaaa taaaaattct ttttgattat tttttgtttt catttaccag 2460 

aataaaaacg taagaattaa aagtttgatt acaaaaaaaa aaaaaaa 2507 



<210> 333 

<211> 3030 

<212> DNA 

<213> Homo sapien 

<400> 333 

gcaggcgact tgcgagctgg gagcgattta aaacgctttg gattcccccg gcctgggtgg 60 

ggagagcgag ctgggtgccc cctagattcc ccgcccccgc acctcatgag ccgaccctcg 120 

gctccatgga gcccggcaat tatgccacct tggatggagc caaggatatc gaaggcttgc 180 

tgggagcggg aggggggcgg aatctggtcg cccactcccc tctgaccagc cacccagcgg 24 0 

cgcctacgct gatgcctgct gtcaactatg cccccttgga tctgccaggc tcggcggagc 300 

.^.^ccaaagca atgccaccca tgccctgggg tgccccaggg gacgtcccca gctcccgtgc 360 

cttatggtta ctttggaggc gggtactact cctgccgagt gtcccggagc tcgctgaaac 420 

cctgtgccca ggcagccacc ctggccgcgt accccgcgga gactcccacg gccggggaag 480 

agtaccccag ycgccccact gagtttgcct tctatccggg atatccggga acctaccagc 540 

ctatggccag ttacctggac gtgtctgtgg tgcagactct gggtgctcct ggagaaccgc 600 

gacatgactc cctgttgcct gtggacagtt accagtcttg ggctctcgct ggtggctgga 660 

acagccagat gtgttgccag ggagaacaga acccaccagg tcccttttgg aaggcagcat 720 

ttgcagactc cagcgggcag caccctcctg acgcctgcgc ctttcgtcgc ggccgcaaga 780 

aacgcattcc gtacagcaag gggcagttgc gggagctgga gcgggagtat gcggctaaca 840 

agttcatcac caaggacaag aggcgcaaga tctcggcagc caccagcctc tcggagcgcc 900 

agattaccat ctggtttcag aaccgccggg tcaaagagaa gaaggttctc gccaaggtga 960 

agaacagcgc taccccttaa gagatctcct tgcctgggtg ggaggagcga aagtgggggt 1020 

qLcctgggga gaccaggaac ctgccaagcc caggctgggg ccaaggactc tgctgagagg 1080 

cccctagaga caacaccctt cccaggccac tggctgctgg actgttcctc aggagcggcc 1140 

tgggtaccca gtatgtgcag ggagacggaa ccccatgtga cagcccactc caccagggtt 1200 

cccaaagaac ctggcccagt cataatcatt catcctgaca gtggcaataa tcacgataac 1260 

cagtactagc tgccatgatc gttagcctca tattttctat ctagagctct gtagagcact 1320 

ttagaaaccg ctttcatgaa ttgagctaat tatgaataaa tttggaaggc gatccctttg 1380 

cagggaagct ttctctcaga cccccttcca ttacacctct caccctggta acagcaggaa 1440 

gactgaggag aggggaacgg gcagattcgt tgtgtggctg tgatgtccgt ttagcatttt 1500 

rctcagctga cagctgggta ggtggacaat tgtagaggct gtctcttcct ccctccttgt 1560 

ccaccccata gggtgtaccc actggtcttg gaagcaccca tccttaatac gatgattttt 1620 

'-^tgtcgtgtg aaaatgaagc cagcaggctg cccctagtca gtccttcctt ccagagaaaa 1680 

.iyagatttga gaaagtgcct gggtaattca ccattaattt catcccccaa actctctgag 1740 

l-Cttccctta atatttctgg tggttctgac caaagcaggt catggtttgt. tgagcatttg 1800 

ggatcccagt gaagtagatg tttgtagcct tgcatactta gcccttccca ggcacaaacg 1860 
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gagtggcaga gtggtgccaa ccctgttttc ccagtccacg tagacagatt cacagtgcgg 1920 

aattctggaa gctggagaca gacgggctct ttgcagagcc gggactctga gagggacatg 1980 

agggcctctg cctctgtgtt cattctctga tgtcctgtac ctgggctcag tgcccggtgg 2040 

gactcatctc ctggccgcgc agcaaagcca gcgggttcgt gctggtcctt cctgcacctt 210O 

aggctggggg tggggggcct gccggcgcat tctccacgat tgagcgcaca ggcctgaagt 2160 

ctggacaacc cgcagaaccg aagctccgag cagcgggtcg gtggcgagta gtggggtcgg 2220 

tggcgagcag ttggtggtgg gccgcggccg ccactacctc gaggacattt ccctcccgga 2280 

gccagctctc ctagaaaccc cgcggcggcc gccgcagcca agtgtttatg gcccgcggtc 2340 

gggtgggatc ctagccctgt ctcctctcct gggaaggagt gagggtggga cgtgacttag 2400 

acacctacaa atctatttac caaagaggag cccgggactg agggaaaagg ccaaagagtg 24 60 

tgagtgcatg cggactgggg gttcagggga agaggacgag gaggaggaag atgaggtcga 2520 

tttcctgatt taaaaaatcg tccaagcccc gtggtccagc ttaaggtcct cggttacatg 2580 

cgccgctcag agcaggtcac tttctgcctt ccacgtcctc cttcaaggaa gccccatgtg 2640 

ggtagctttc aatatcgcag gttcttactc ctctgcctct ataagctcaa acccaccaac 2700 

gatcgggcaa gtaaaccccc tccctcgccg acttcggaac tggcgagagt tcagcgcaga 2760 

tgggcctgtg gggagggggc aagatagatg agggggagcg gcatggtgcg gggtgacccc 2820 

ttggagagag gaaaaaggcc acaagagggg ctgccaccgc cactaacgga gatggccctg 2880 

gtagagacct ttgggggtct ggaacctctg gactccccat gctctaactc ccacactctg 2940 

ctatcagaaa cttaaacttg aggattttct ctgtttttca ctcgcaataa aytcagagca 3000 

aacaaaaaaa aaaaaaaaaa aaaactcgag 3030 

<210> 334 

<211> 2417 

<212> DNA 

<213> Homo sapien 

<400> 334 

ggcggccgct ctagagctag tgggatcccc cgggctgcac gaattcggca cgagtgagtt 60 

ggagttttac ctgtattgtt ttaatttcaa caagcctgag gactagccac aaatgtaccc 120 

agtttacaaa tgaggaaaca ggtgcaaaaa ggttgttacc tgtcaaaggt cgtatgtggc 180 

agagccaaga tttgagccca gttatgtctg atgaacttag cctatgctct ttaaacttct 240 

gaatgctgac cattgaggat atctaaactt agatcaattg cattttccct ccaagactat 300 

ttacttatca atacaaraat accaccttta ccaatctatt gttttgatac gagactcaaa 360 

tatgccagat atatgtaaaa gcaacctaca agctctctaa tcatgctcac ctaaaagatt 420 

cccgggatct aataggctca aagaaacttc ttctagaaat ataaaagaga aaattggatt 4 80 

atgcaaaaat tcattattaa tttttttcat ccatccttta attcagcaaa catttatctg 540 

ttgttgactt tatgcagtat ggccttttaa ggattggggg acaggtgaag aacggggtgc 600 

cagaatgcat cctcctacta atgaggtcag tacacatttg cattttaaaa tgccctgtcc 660 

agctgggcat ggtggatcat gcctgtaatc tcaacattgg aaggccaagg caggaggatt 720 

gcttcagccc aggagttcaa gaccagcctg ggcaacatag aaagacccca tctctcaatc 780 

aatcaatcaa tgccctgtct ttgaaaataa aactctttaa gaaaggttta atgggcaggg 840 

tgtggtagct catgcctata atacagcact ttgggaggct gaggcaggag gatcacttta 900 

gcccagaagt tcaagaccag cctgggcaac aagtgacacc tcatctcaat tttttaataa 960 

aatgaataca tacataagga aagataaaaa gaaaagttta atgaaagaat acagtataaa 1020 

acaaatctct tggacctaaa agtatttttg ttcaagccaa atattgtgaa tcacctctct 1080 

gtgttgagga tacagaatat ctaagcccag gaaactgagc agaaagttca tgtactaact 1140 

aatcaacccg aggcaaggca aaaatgagac taactaatca atccgaggca aggggcaaat 1200 

tagacggaac ctgactctgg tctattaagc gacaactttc cctctgttgt atttttcttt 1260 

tattcaatgt aaaaggataa aaactctcta aaactaaaaa caatgtttgt caggagttac 1320 

aaaccatgac caactaatta tggggaatca taaaatatga ctgtatgaga tcttgatggt 1380 

ttacaaagtg tacccactgt taatcacttt aaacattaat gaacttaaaa atgaatttac 1440 

ggagattgga atgtttcttt cctgttgtat tagttggctc aggctgccat aacaaaatac 1500 

cacagactgg gaggcttaag taacagaaat tcatttctca cagttctggg ggctggaagt 1560 

ccacgatcaa ggtgcaggaa aggcaggctt cattctgagg cccctctctt ggctcacatg 1620 

tggccaccct cccactgcgt gctcacatga cctctttgtg ctcctggaaa gagggtgtgg 1680 

gggacagagg gaaagagaag gagagggaac tctctggtgt ctcgtctttc aaggacccta 1740 

acctgggcca ctttggccca ggcactgtgg ggtggggggt tgtggctgct ctgctctgag 1800 

tggccaagat aaagcaacag aaaaatgtcc aaagctgtgc agcaaagaca agccaccgaa 1860 
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cagggatctg ctcatcagrg tggggacctc caagtcggcc accctggagg caagccccca 1920 

cagagcccat gcaaggtggc agcagcagaa gaagggaatt gtccctgtcc ttqgcacatt 1980 

ccrcaccgac ctggtgatgc tggacactgc gargaatggt aatgtggatg agaatatgat 2040 

ggacrcccag aaaaggagac ccagctgctc aggtggctgc aaatcattac agccttcatc 2100 

ctggggagga actgggggcc tggttctggg tcagagagca gcccagtgag ggtgagagct 2160 

acagcctgtc ctgccagctg gatccccagt cccggtcaac cagtaatcaa ggctgagcag 2220 

J ■-cuiqcttc ccggagctgg tcttgggaag ccagccctqg ggtgagttgg ctccrgctgt 2280 

ggtactgaga caatattgtc ataaattcaa tgcgcccttg tatccctttt tcttttttat 2340 

ctgtctacat ctataatcac tatgcatact agtctttgtt agtgtttcta ttcmacttaa 2400 

tagagatatg ttatact 2417 



<210> 335 

<211> 2984 

<212> DNA 

<213> Homo sapien 



<400> 335 

atocctcctt ccccactctc ctttccagaa ggcacttggg gtcttatctg ttggactctg 60 

aaaacacttc aggcgccctt ccaaggcttc cccaaacccc taagcagccg cagaagcgct 120 

cccgagctgc cttctcccac actcaggtga tcgagttgga gaggaagttc agccatcaga 180 

agtacctgtc ggcccctgaa cgggcccacc tggccaagaa cctcaagctc acggagaccc 240 

aagtgaagat atggttccag aacagacgct ataagactaa gcgaaagcag ctctcctcgg 300 

agctgggaga cttggagaag cactcctctt tgccggccct gaaagaggag gccttctccc 360 

gggcctccct ggtctccgtg tataacagct atccttacta cccatacctg tactgcgtgg 420 

gcagctggag cccagctttt tggtaatgcc agctcaggtg acaaccatta tgatcaaaaa 480 

ctgccttccc cagggtgtct ctatgaaaag cacaaggggc caaggtcagg gagcaagagg 540 

tgtgcacacc aaagctattg gagatttgcg tggaaatctc asattcttca ctggtgagac 600 

aatgaaacaa cagagacagt gaaagtttta atacctaagt cattccccca gtgcatactg 660 

taggtcattt tttttgcttc tggctacctg tttgaagggg agagagggaa aatcaagtgg 720 

tattttccag cactttgtat gattttggat gagctgtaca cccaaggatt ctgttctgca 780 

actccatcct cctgtgtcac tgaatatcaa ctctgaaaga gcaaacctaa caggagaaag 840 

■jaoaaccagg atgaggatgt caccaactga attaaactta agtccagaag cctcctgttg 900 

qccttggaat atggccaagg ctctctctgt ccctgtaaaa gagaggggca aatagagagt 960 

ctccaagaga acgccctcat gctcagcaca tatttgcatg ggagggggag atgggtggga 1020 

ggagatgaaa atatcagctt ttcttattcc tttttattcc ttttaaaatg gtatgccaac 1080 

ttaagtattt acagggtggc ccaaatagaa caagatgcac tcgctgtgat tttaagacaa 1140 

gc';,gtataaa cagaactcca ctgcaagagg gggggccggg ccaggagaat ctccgcttgt 1200 

ccaagacagg ggcctaagga gggtctccac actgctgcta ggggctgttg cattttttta 1260 

ttagtagaaa gtggaaaggc ctcttctcaa cttttttccc ttgggctgga gaatttagaa 1320 

tcagaagttt cctggagttt tcaggctatc atatatactg tatcctgaaa ggcaacataa 1380 

ttcttccttc cctcctttta aaattttgtg ttcctttttg cagcaattac tcactaaagg 1440 

gcttcatttt agtccagatt tttagtctgg ctgcacctaa cttatgcctc gcttatttag 1500 

cccgagatct ggtctttttt tttttttttt tttttccgtc tccccaaagc tttatctgtc 1560 

t--tQacttttt aaaaaagttt gggggcagat tctgaattgg ctaaaagaca tgcattttta 1620 

Uf.iac;tagcaa ctcttatttc tttcctttaa aaatacatag cattaaatcc caaatcctat 1680 

ttaaagacct gacagcttga gaaggtcact actgcattta taggaccttc tggtggttct 1740 

gctgttacgt ttgaagtctg acaatccttg agaatctttg catgcagagg aggtaagagg 1800 

tattggattt tcacagagga agaacacagc gcagaatgaa gggccaggct tactgagctg 1860 

tccagtggag ggctcatggg tgggacatgg aaaagaaggc agcctaggcc ctggggagcc 1920 

cagtccactg agcaagcaag ggactgagtg agccttttgc aggaaaaggc taagaaaaag 1980 

gaaaaccatt ctaaaacaca acaagaaact gtccaaatgc tttgggaact gtgtttattg 2040 

cctataatgg gtccccaaaa tgggtaacct agacttcaga gagaatgagc agagagcaaa 2100 

ggagaaatct ggctgtcctt ccattttcat tctgttatct caggtgagct ggtagagggg 2160 

agacattaga aaaaaatgaa acaacaaaac aattactaat gaggtacgct gaggcctggg 2220 

rig'.ctcttga ctccactact taattccgtt tagtgagaaa cctttcaatt ttcttttatt 2280 

n.. :4^;jqgcca gcttactgtt ggtggcaaaa ttgccaacat aagttaatag aaagttggcc 2340 

aatttcaccc cattttctgt ggtttgggct ccacattgca atgttcaatg ccacgtgctg 2400 

ctgacaccga ccggagtact agccagcaca aaaggcaggg tagcctgaat tgctttctgc 24 60 
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tctttacatt tcttttaaaa taagcattta gtgctcagtc cctactgagt actctttctc 2520 

tcccctcctc tgaatttaat tctttcaact tgcaatttgc aaggattaca catttcactg 2580 

tgatgtatat tgtgttgcaa aaaaaaaaaa aagtgtcttt gtttaaaatt acttggtttg 2640 

tgaatccatc ttgctttttc cccattggaa ctagtcatta acccatctct gaactggtag 2700 

aaaaacatct gaagagctag tctatcagca tctgacaggt gaattggatg gttctcagaa 27 60 

ccatttcacc cagacagcct gtttctatcc tgtttaataa attagtttgg gttctctaca 2820 

tgcataacaa accctgctcc aatctgtcac ataaaagtct gtgacttgaa gtttagtcag 2880 

cacccccacc aaactttatt tttctatgtg ttttttgcaa catatgagtg ttttgaaaat 2940 

aaagtaccca tgtctttatt agaaaaaaaa aaaaaaaaaa aaaa 2984 

<210> 336 
<211> 147 
<212> PRT 
<213> Homo sapien 

<400> 336 

Pro Ser Phe Pro Thr Leu Leu Ser Arg Arg His Leu Gly Ser Tyr Leu 

15 10 15 

Leu Asp Ser Glu Asn Thr Ser Gly Ala Leu Pro Arg Leu Pro Gin Thr 

20 25 30 

Pro Lys Gin Pro Gin Lys Arg Ser Arg Ala Ala Phe Ser His Thr Gin 

35 40 45 

Val lie Glu Leu Glu Arg Lys Phe Ser His Gin Lys Tyr Leu Ser Ala 

50 55 60 

Pro Glu Arg Ala His Leu Ala Lys Asn Leu Lys Leu Thr Glu Thr Gin 
^5 70 75 80 

Val Lys He Trp Phe Gin Asn Arg Arg Tyr Lys Thr Lys Arg Lys Gin 

85 90 95 

Leu Ser Ser Glu Leu Gly Asp Leu Glu Lys His Ser Ser Leu Pro Ala 

100 105 110 

Leu Lys Glu Glu Ala Phe Ser Arg Ala Ser Leu Val Ser Val Tyr Asn 

115 120 125 

Ser Tyr Pro Tyr Tyr Pro Tyr Leu Tyr Cys Val Gly Ser Trp Ser Pro 

130 135 140 

Ala Phe Trp 
145 

<210> 337 

<211> 9 

<212> PRT 

<213> Homo sapien 

<400> 337 

Ala Leu Thr Gly Phe Thr Phe Ser Ala 
1 5 

<210> 338 

<211> 9 

<212> PRT 

<213> Homo sapien 

<400> 338 

Leu Leu Ala Asn Asp Leu Met Leu He 
1 5 



<210> 339 
<211> 318 
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<212> PRT 

<213> Homo sapien 



<400> 339 



Met: 


Val 


Glu 


Leu 


Met 


Phe 


Pro 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Pro 


Phe 


Leu 


1 








5 










10 










15 




i eu 


Tyr 


Met 


Ala 


Ala 


Pro 


Gin 


He 


Arg 


Lys 


Met 


Leu 


Ser 


Ser 


Gly 


Val 








20 










25 










30 






Cvs 


Thr 


Ser 


Thr 


Val 


Gin 


Leu 


Pro 


Gly 


Lys 


Val 


Val 


Val 


Val 


Thr 


Gly 






35 










40 










45 








Ala 


Asn 


Thr 


Gly 


He 


Gly 


Lys 


Glu 


Thr 


A.la 


Lys 


Glu 


Leu 


Ala 


Gin 


Arg 




50 










55 










60 










Gly 


Ala 


Arg 


Val 


Tyr 


Leu 


Ala 


Cys 


Arg 


Asp 


Val 


Glu 


Lys 


Gly 


Glu 


Leu 


65 










70 










75 










80 


Val 


Ala 


Lys 


Glu 


He 


Gin 


Thr 


Thr 


Thr 


Gly 


Asn 


Gin 


Gin 


Val 


Leu 


Val 










85 










90 










95 




Arg 


Lys 


Leu 


Asp 


Leu 


Ser 


Asp 


Thr 


Lys 


Ser 


He 


A.rg 


Ala 


Phe 


Ala 


Lys 








100 










105 










110 






Gly 


Phe 


Leu 


Ala 


Glu 


Glu 


Lys 


His 


Leu 


His 


Val 


Leu 


He 


Asn 


Asn 


Ala 






115 










120 










125 








Gly 


Val 


Met 


Met 


Cys 


Pro 


Tyr 


Ser 


Lys 


Thr 


Ala 


Asp 


Gly 


Phe 


Glu 


Met 




130 










135 










140 










His 


lie 


Gly 


Val 


Asn 


His 


Leu 


Gly 


His 


Phe 


Leu 


Leu 


Thr 


His 


Leu 


Leu 












150 










155 










160 


i.eu 


Glu 


Lys 


Leu 


Lys 


Glu 


Ser 


Ala 


Pro 


Ser 


Arg 


He 


Val 


Asn 


Val 


Ser 










165 










170 










175 




Ser 


Leu 


Ala 


His 


His 


Leu 


Gly 


Arg 


He 


His 


Phe 


His 


Asn 


Leu 


Gin 


Gly 








180 










185 










190 






Glu 


Lys 


Phe 


Tyr 


Asn 


Ala 


Gly 


Leu 


Ala 


Tyr 


Cys 


His 


Ser 


Lys 


Leu 


Ala 






195 










200 










205 








Asn 


He 


Leu 


Phe 


Thr 


Gin 


Glu 


Leu 


Ala 


Arg 


Arg 


Leu 


Lys 


Gly 


Ser 


Gly 




210 










215 










220 










Val 


Thr 


Thr 


Tyr 


Ser 


Val 


His 


Pro 


Gly 


Thr 


Val 


Gin 


Ser 


Glu 


Leu 


Val 


225 










230 










235 










240 


Arg 


His 


Ser 


Ser 


Phe 


Met 


Arg 


Trp 


Met 


Trp 


Trp 


Leu 


Phe 


Ser 


Phe 


Phe 










245 










250 










255 




lie 


Lys 


Thr 


Pro 


Gin 


Gin 


Gly 


Ala 


Gin 


Thr 


Ser 


Leu 


His 


Cys 


Ala 


Leu 








260 










265 










270 






Thr 


Glu 


Gly 


Leu 


Glu 


He 


Leu 


Ser 


Gly 


Asn 


His 


Phe 


Ser 


Asp 


Cys 


His 






275 










280 










285 








Val 


Ala 


Trp 


Val 


Ser 


Ala 


Gin 


Ala 


Arg 


Asn 


Glu 


Thr 


He 


Ala 


Arg 


Arg 




290 










295 










300 










Leu 


Trp 


Asp 


Val 


Ser 


Cys 


Asp 


Leu 


Leu 


Gly 


Leu 


Pro 


He 


Asp 







305 310 315 



<210> 340 

<211> 483 

<212> DNA 

<213> Homo sapien 

<400> 340 

gccgaggtct gccttcacac ggaggacacg agactgcttc ctcaagggct cctgcctgcc 60 

tggacactgg tgggaggcgc tgtttagttg gctgttttca gaggggtctt tcggagggac 120 

ctcctgctgc aggctggagt gtctttattc ctggcgggag accgcacatt ccactgctga 180 

ggttgtgggg gcggtttatc aggcagtgat aaacataaga tgtcatttcc ttgactccgg 240 

ccttcaattt tctctttggc tgacgacgga gtccgtggtg tcccgatgta actgacccct 300 

gcuccaaacg tgacatcact gatgctcttc tcgggggtgc tgatggcccg cttggtcacg 360 

tgctcaatct cgccattcga ctcttgctcc aaactgtatg aagacacctg actgcacgtt 420 
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ttttctgggc ttccagaatt taaagtgaaa ggcagcactc ctaagctccg actccgatgc 480 
ctg 433 

<210> 341 
<211> 344 

<212> DNA 

<213> Homo sapien 

<400> 341 

ctgctgctga gtcacagatt tcattataaa tagcctccct aaggaaaata cactgaatgc 60 

tatttttact aaccattcta tttttataga aatagctgag agtttctaaa ccaactctct 120 

gctgccttac aagtattaaa tattttactt ctttccataa agagtagctc aaaatatgca 180 

attaatttaa taatttctga tgatggtttt atctgcagta atatgtatat catctattag 240 

aatttactta atgaaaaact gaagagaaca aaatttgtaa ccactagcac ttaagtactc 300 

ctgattctta acattgtctt taatgaccac aagacaacca acag 344 

<210> 342 

<211> 592 

<212> DNA 

<213> Hoino sapien 

<400> 342 

acagcaaaaa agaaactgag aagcccaaty tgctttcttg ttaacatcca cttatccaac 60 

caatgtggaa acttcttata cttggttcca ttatgaagtt ggacaattgc tgctatcaca 120 

cctggcaggt aaaccaatgc caagagagtg atggaaacca ttggcaagac tttgttgatg 180 

accaggattg gaattttata aaaatattgt tgatgggaag ttgctaaagg gtgaattact 240 

tccctcagaa gagtgtaaag aaaagtcaga gatgctataa tagcagctat tttaattggc 300 

aagtgccact gtggaaagag ttcctgtgtg tgctgaagtt ctgaagggca gtcaaattca 360 

tcagcatggg ctgtttggtg caaatgcaaa agcacaggtc tttttagcat gctggtctct 4 20 

cccgtgtcct tatgcaaata atcgtcttct tctaaatttc tcctaggctt cattttccaa 4 80 

agttcttctt ggtttgtgat gtcttttctg ctttccatta attctataaa atagtatggc 540 

ttcagccacc cactcttcgc cttagcttga ccgtgagtct cggctgccgc tg 592 

<210> 343 
<211> 382 
<212> DNA 
<213> Homo sapien 

<400> 343 

ttcttgacct cctcctcctt caagctcaaa caccacctcc cttattcagg accggcactt 60 

cttaatgttt gtggctttct ctccagcctc tcttaggagg ggtaatggtg gagttggcat 120 

cttgtaactc tcctttctcc tttcttcccc tttctctgcc cgcctttccc atcctgctgt 180 

agacttcttg attgtcagtc tgtgtcacat ccagtgattg ttttggtttc tgttcccttt 240 

ctgactgccc aaggggctca gaaccccagc aatcccttcc tttcactacc ttcttttttg 300 

ggggtagttg gaagggactg aaattgtggg gggaaggtag gaggcacatc aataaagagg 360 

aaaccaccaa gctgaaaaaa aa 392 

<210> 344 
<211> 536 
<212> DNA 
<213> Homo sapien 



<400> 344 

ctgggcctga agctgtaggg taaatcagag gcaggcttct gagtgatgag agtcctgaga 60 

caataggcca cataaacttg gctggatgga acctcacaat aaggtggtca cctcttgttt 120 

gtttaggggg atgccaagga taaggccagc tcagttatat gaagagaagc agaacaaaca 180 

agtctttcag agaaatggat gcaatcagag tgggatcccg gtcacatcaa ggtcacactc 240 

caccttcatg tgcctgaatg gttgccaggt cagaaaaatc caccccttac gagtgcggct 300 
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LcQnccctat arcccccgcc cgcgtccctt rctccataaa attcttctta gtagctatta 360 

cctrcttatt a^trgatcra gaaattgccc rccrttracc cctaccatga gccctacaaa 420 

raaci-aacct accactaata gttatgtcat ccctcttatt aatcatcatc ctagccctaa 480 

^qgccta tgagtgacta caaaaaqgat tagactgagc cgaataacaa aaaaaa 536 



<210> 34b 
<211> 251 
<212> ONA 
<213> Homo sapien 



<-30C> 3-35 

accttttqaq gtctctctca ccacctccac agccaccgtc accgtgggat gtgctggatg 60 

tgaatgaagc ccccatcttt gtgcctcctg aaaagagagt ggaagtgtcc gaggactttg 120 

qcQigggcca gqaaatcaca tcctacactg cccaggagcc agacacattt atggaacaga 180 

aaataacata tcggatttgg agagacactg ccaactggct ggagattaat ccggacactg 240 

gtgccatttj r 251 



<21C^ 346 

<2ll^ 2 62 

<2i2^ PNA 

<213> !lomo sapien 



<221> rr.i sc__f ca t ure 
<222 > [!)... (282) 
<223> r. - A,T,C or G 



<400> 346 

cgcgtctctg acactgtgat catgacaggg gttcaaacag aaagtgcctg ggccctcctt 60 

ctaagtcttg ttaccaaaaa aaggaaaaag aaaagatctt ctcagttaca aattctggga 120 

agggagacta tacctggctc ttgccctaag tgagaggtct tccctcccgc accaaaaaat 180 

agaaaggctt tctatttcac tggcccaggt agggggaagg agagtaactt tgagtctgtg 240 

ggVctcattt cccaaggtgc cttcaatgct catnaaaacc aa 282 



<210> 347 
<211> 201 
<212> DNA 
<213> Homo sapien 

<220> 

<221> mi sc_f eature 
<222> (1) . . . (201) 
<223> n = A,T,C or G 



<400> 347 

acacacataa tattataaaa tgccatctaa ttggaaggag ctttctatca ttgcaagtca 60 

taaatataac ttttaaaana ntactancag cttttaccta ngctcctaaa tgcttgtaaa 120 

tctgagactg actggaccca cccagaccca gggcaaagat acatgttacc atatcatctt 180 

tataaagaat ttttttttgt c 201 



<210> 348 
<211> 251 
<212> DNA 
<213> Homo sapien 

<400> 348 

ctgttaatca caacatttgt gcatcacttg tgccaagtga gaaaatgttc taaaatcaca 60 
agagagaaca gtgccagaat gaaactgacc ctaagtccca ggtgcccctg ggcaggcaga 120 
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aggagacact cccagcatgg aggagggttt atcttttcat cctaggtcag gtctacaatg 180 
ggggaaggtt ttattataga actcccaaca gcccacctca ctcctgccac ccacccgatg 240 
gccctgcctc c 05^ 

<210> 349 

<211> 251 
<212> DNA 
<213> Homo sapien 

<400> 349 

taaaaatcaa gccatttaat tgtatctttg aaggtaaaca atatatggga gctggatcac 60 

aacccctgag gatgccagag ctatgggtcc agaacatggt gtggtattat caacagagtt 120 

cagaagggtc tgaactctac gtgttaccag agaacataat gcaattcatg cattccactt 180 

agcaattttg taaaatacca gaaacagacc ccaagagtct ttcaagatga ggaaaattca 240 

actcctggtt t 251 

<210> 350 
<211> 908 
<212> DNA 
<213> Homo sapien 

<400> 350 

ctggacactt tgcgagggct tttgctggct gctgctgctg cccgtcatgc tactcatcgt 60 

agcccgcccg gtgaagctcg ctgctttccc tacctcctta agtgactgcc aaacgcccac 120 

cggctggaat tgctctggtt atgatgacag agaaaatgat ctcttcctct gtgacaccaa 180 

cacctgtaaa tttgatgggg aatgtttaag aattggagac actgtgactt gcgtctgtca 240 

gttcaagtgc aacaatgact atgtgcctgt gtgtggctcc aatggggaga gctaccagaa 300 

tgagtgttac ctgcgacagg ctgcatgcaa acagcagagt gagatacttg tggtgtcaga 360 

aggatcatgt gccacagtcc atgaaggctc tggagaaact agtcaaaagg agacatccac 420 

ctgtgatatt tgccagtttg gtgcagaatg tgacgaagat gccgaggatg tctggtgtgt 480 

gtgtaatatt gactgttctc aaaccaactt caatcccctc tgcgcttctg atgggaaatc 540 

ttatgataat gcatgccaaa tcaaagaagc atcgtgtcag aaacaggaga aaattgaagt 600 

catgtutttg ggtcgatgtc aagataacac aactacaact actaagtctg aagatgggca 660 

ttatgcaaga acagattatg cagagaatgc taacaaatta gaagaaagtg ccagagaaca 720 

ccacatacct tgtccggaac attacaatgg cttctgcatg catgggaagt gtgagcattc 780 

tatcaatatg caggagccat cttgcaggtg tgatgctggt tatactggac aacactgtga 840 

aaaaaaggac tacagtgttc tatacgttgt tcccggtcct gtacgatttc agtatgtctt 900 
aatcgcag 

<210> 351 
<211> 472 
<212> DNA 
<213> Homo sapien 

<400> 351 

ccagttattt gcaagtggta agagcctatt taccataaat aatactaaga accaactcaa 60 

gtcaaacctt aatgccattg ttattgtgaa ttaggattaa gtagtaattt tcaaaattca 120 

cattaacttg attttaaaat cagwtttgyg agtcatttac cacaagctaa atgtgtacac 180 

tatgataaaa acaaccattg tattcctgtt tttctaaaca gtcctaattt ctaacactgt 240 

atatatcctt cgacatcaat gaactttgtt ttcttttact ccagtaataa agtaggcaca 300 

gatctgtcca caacaaactt gccctctcat gccttgcctc tcaccatgct ctgctccagg 360 

tcagccccct tttggcctgt ttgttttgtc aaaaacctaa tctgcttctt gcttttcttg 420 

gtaatatata tttagggaag atgttgcttt gcccacacac gaagcaaagt aa 472 

<210> 352 
<211> 251 
<212> DNA 
<213> Homo sapien 
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<400> 352 

jL^caaaacta azczctoqqg aatcaaacca gaaaagggca aggatcttag gcatcgtgga 60 

tglngataag gccaggtcaa tggctgcaag catgcagaga aagaggtaca tcggagcgtg 120 

caaqctgcgt tccgtcctta cgatgaagac cacgatgcag tttccaaaca ttgccactac 180 

atacatggaa aggaggggga agccaaccca gaaatgggct ttctctaatc ctgggatacc 240 

aataagcacaa 251 

<210> 353 
<211> 436 
<212> DNA 
<213> Homo sapien 

<400> 353 

tttttttttt tttttttttt ttttttacaa caatgcagtc atttatttat tgagtatgtg 60 

cacattatgg tattattact atactgatta tatttatcat gtgacttcta attaraaaat 120 

gtatccaaaa gcaaaacagc agatatacaa aattaaagag acagaagata gacattaaca 180 

gataaggcaa cttatacatt gacaatccaa atccaataca tttaaacatt tgggaaatga 240 

gqgggacaaa tggaagccar atcaaatttg tgtaaaacta ttcagtatgt ttcccttgct 300 

tcatgtctga raaggctctc ccttcaatgg ggatgacaaa ctccaaatgc cacacaaatg 360 

ttaacagaat actagattca cactggaacg ggggtaaaga agaaattatt ttctataaaa 420 

qggctcctaa tgtaqt 436 

<210> 354 
<211> 854 
<212> DNA 
<213> Homo sapien 

<400> 354 

ccttttctag ttcaccagtt ttctgcaagg atgctggtta gggagtgtct gcaggaggag 60 

caagtctgaa accaaatcta ggaaacatag gaaacgagcc aggcacaggg ctggtgggcc 120 

alcagggacc accctttggg ttgatatttt gcttaatctg catcttttga gtaagatcat 180 

ctggcagtag aagctgttct ccaggtacat ttctctagct catgtacaaa aacatcctga 240 

aggactttgt caggtgcctt gctaaaagcc agatgcgttc ggcacttcct tggtctgagg 300 

ttaattgcac acctacaggc actgggctca tgctttcaag tattttgtcc tcactttagg 360 

gtqagtgaaa gatccccatt ataggagcac ttgggagaga tcatataaaa gctgactctt 420 

gagracatgc agtaatgggg tagatgtgtg tggtgtgtct tcattcctgc aagggtgctt 480 

gttagggagt gtttccagga ggaacaagtc tgaaaccaat catgaaataa atggtaggtg 540 

tgaactggaa aactaattca aaagagagat cgtgatatca gtgtggttga tacaccttgg 600 

caatatggaa ggctctaatt tgcccatatt tgaaataata attcagcttt ttgtaataca 660 

aaataacaaa ggattgagaa tcatggtgtc taatgtataa aagacccagg aaacataaat 720 

atatcaactg cataaatgta aaatgcatgt gacccaagaa ggccccaaag tggcagacaa 780 

cattgtaccc attttccctt ccaaaatgtg agcggcgggc ctgctgcttt caaggctgtc 840 

acacgggatg tcag 854 

<210> 355 
<211> 676 
<212> DNA 
<213> Homo sapien 

<400> 355 

gaaattaagt atgagctaaa ttccctgtta aaacctctag gggtgacaga tctcttcaac 60 

■ :cJ c;qt:caaag ctgatctttc tggaatgtca ccaaccaagg gcctatattt atcaaaagcc 120 

citccdcaagt catacctgga tgtcagcgaa gagggcacgg aggcagcagc agccactggg 180 

qacagcatCG ctgtaaaaag cctaccaatg agagctcagt tcaaggcgaa ccaccccttc 240 

■:\ qttcttta taaggcacac tcataccaac acgatcctat tctgtggcaa gcttgcctct 300 

ccctaatcag atggggttga gtaaggctca gagttgcaga tgaggtgcag agacaatcct 360 

gtgactttcc cacggccaaa aagctgttca cacctcacgc acctctgtgc ctcagtttgc 420 
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480 
540 



tcatctgcaa aataggtcta ggatttcttc caaccatttc atgagttgtg aagctaaggc 
tttgttaatc atggaaaaag gtagacttat gcagaaagcc tttctggctt tcttatctgt 
ggtgtctcat ttgagtgctg tccagtgaca tgatcaagtc aatgagtaaa attttaaggg 600 
attagatttt cttgacttgt atgtatctgt gagatcttga ataagtgacc tgacatctct 660 
gcttaaagaa aaccag 



<210> 356 
<211> 574 
<212> DNA 
<2i3> Homo sapien 



<400> 356 

tttttttttt tttttcagga aaacattctc ttactttatt tgcatctcag caaaggttct 60 

catgtggcac ctgactggca tcaaaccaaa gttcgtaggc caacaaagat gggccactca 120 

caagcttccc atttgtagat ctcagtgcct atgagtatct gacacctgtt cctctcttca 180 

gtctcttagg -.cggcttaaa tctgtctcag gtgtgctaag agtgccagcc caaggJcggtc 240 

aaaagtccac daaactgcag tctttgctgg gatagtaagc caagcagtgc ctggacagca 300 

gagttctttt cttgggcaac agataaccag acaggactct aatcgtgctc ttattcaaca 360 

ttcttctgtc ^.ctgcctaga ctggaataaa aagccaatct ctctcgtggc acagggaagg 420 

agatacaagc tcgtttacat gtgatagatc taacaaaggc atctaccgaa gtctggtctg 480 

gatagacggc ajagggagct cttaggtcag cgctgctggt tggaggacat tcctgagtcc 540 

agctttgcag cctttqtgca acagtacttt ccca 574 

<210> 357 
<211> 393 
<212> DNA 
<213> Homo sapien 



<400> 357 

tttttttttt tttttttttt tttttttttt tacagaatat aratgcttta tcactgkact 60 
^^^^^^^^^^ '^cttgttcac tatacttaaa aatgcaccac tcataaatat ttaattcagc 120 

180 
240 
300 
360 



aagccacaac caaracttga ttttatcaac aaaaacccct aaatataaac aasaaaaaaa 
atagatataa ttattccagt ttttttaaaa cttaaaarat attccattgc cgaattaara 
araarataag tgttatatgg aaagaagggc attcaagcac actaaaraaa cctgaggkaa 
gcataatctg tacaaaatta aactgtcctt tttggcattt taacaaattt gcaacgktct 
tttttttctt tttctgtttt tttttttttt tac 393 

<210> 358 
<21I> 630 
<212> DNA 
<213> Homo sapien 

<400> 358 

acagggtaaa caggaggatc cttgctctca cggagcttac attctagcag gaggacaata 60 

ttaatgttta taggaaaatg atgagtttat gacaaaggaa gtagatagtg ttttacaaga 120 

gcatagagta gggaagctaa tccagcacag ggaggtcaca gagacatccc taaggaagtg 180 

gagtttaaac tgagagaagc aagtgcttaa actgaaggat gtgttgaaga agaagggaga 24 0 

gtagaacaat ttgggcagag ggaaccttat agaccctaag gtgggaaggt tcaaagaact 300 

gaaagagagc tagaacagct ggagccgttc tccggtgtaa agaggagtca aagagataag 360 

attaaagatg tgaagattaa gatcttggtg gcattcaggg attggcactt ctacaagaaa 420 

tcactgaagg gagtaatgtg acattacttt tcacttcagg atggccattc taactccagg 480 

gggtagactg gactaggtaa gactggaggc aggtagacct cttctaaggc ctgcgatagt 54 0 
gaaagacaaa aataagtggg gaaattcagg ggatagtgaa aatcagtagg acttaatgag 
caagccagag gttcctccac aacaaccagt 



<210> 359 
<211> 620 
<212> DNA 



600 
630 
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<213> Homo sapien 



<4C0> 359 

-^^sa jcattcc aaaatataca tctagagact aarrgtaaat gctcratagt gaagaagtaa 60 

taattaaaaa atgctactaa tatagaaaat ttataatcag aaaaataaat attcagggag 120 

ctcdccagaa gaataaagtg ctctgccagt tattaaagga ttactgctgg tgaattaaat 180 

-.T ggcattcc ccaagggaaa tagagagatt cttctggatt atgttcaata tttatttcac 240 

aggattaact gttttaggaa cagatataaa gcttcgccac ggaagagatg gacaaagcac 300 

ciaaqac:aaca tgatacctta ggaagcaaca ctaccctttc aggcataaaa tttggagaaa 360 

tqcc^acatta tgcttcatga ataatatgta gaaagaaggt ctgatgaaaa tgacatcctt 420 

.TcitOLaagat aactttataa gaattctggg tcaaataaaa ttctttgaag aaaacatcca 480 

aatgtcattg acttatcaaa tactatcttg gcatataacc tatgaaggca aaactaaaca 540 

aaC'j^aaagc tcacaccaaa caaaaccatc aacttatttt gtattctata acatacgaga 600 

o"! gt aaagat gtgacagtgt 620 



<210> 360 
<211> 431 
<212> DNA 
<213> Homo sapien 

<400> 360 
aa aa ci a saa a agccagaaca a cat gt gat a 
t.qcitgaatga tgaacgtgat ggactattgt 
La ^t:catcat ttttggccag cagttgtttg 
aaaccttctt agctcttgag aagtcaaagt 
tggactcctt atgtgagagc agcggctacc 
agtggacatg cagtggcaga gctcctggta 
tgatgccaag cgtgacacct gtagcactca 
agattcttag t 

<210> 361 

<211> 351 

<212> DNA 

<213> Homo sapien 



<400> 361 

Hcactgattt ccgatcaaaa gaatcatcat ctttaccttg acttttcagg gaattactga 60 

cjol f tctt ct cagaagatag ggcacagcca ttgccttggc ctcacttgaa gggtctgcat 120 

ttgggtcctc tggtctcttg ccaagtttcc cagccactcg agggagaaat atcgggaggt 180 

ttgacttcct ccggggcttt cccgagggct tcaccgtgag ccctgcggcc ctcagggctg 240 

caatcctgga ttcaatgtct gaaacctcgc tctctgcctg ctggacttct gaggccgtca 300 

ctgccactct gtcctccagc tctgacagct cctcatctgt ggtcctgttg t 351 



<210> 362 
<211> 463 
<212> DNA 
<2 13> Homo sapien 



<400> 362 

acttcatcag gccataatgg gtgcctcccg tgagaatcca agcacctttg gactgcgcga 60 

tgtagatgag ccggctgaag atcttgcgca tgcgcggctt cagggcgaag ttcttggcgc 120 

^cccggtcac agaaatgacc aggttgggtg ttttcaggtg ccagtgctgg gtcagcagct 180 

: ri ; /i.jaggat ttccgcgtcc gtgtcgcagg acagacgtat atacttccct ttcttcccca 240 

gtgtctcaaa ctgaatatcc ccaaaggcgt cggtaggaaa ttccttggtg tgtttcttgt 300 

aqtvccattt ctcactttgg ttgatctggg tgccttccat gtgctggctc tgggcatagc 360 

/.a .iC'ttgca cacattctcc ctgataagca cgatggtgtg gacaggaagg aaggatttca 420 

i^^^: ; :ctgc ttatggaaac tggtattgtt agcttaaata gac 463 



gataatatga ttggctgcac 
atggagcaca tcttcagcaa 
atcaccaaac atcatgccag 
ccgggggaat ttattcctgg 
cagctggggt ggtggagcga 
accacctaga ggaatacaca 
aatttgtctt gtttttgtct 



acttccagac 60 

gagggggaaa 120 

aatactcagc 180 

caattttaat 240 

acccgtcact 300 

ggcacatgtg 360 

ttcggtgtgt 420 



431 
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<210> 363 
<211> 653 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (653) 
<223> n = A,T,C or G 

<400> 363 

acccccgagt ncctgnctgg catactgnga acgaccaacg acacacccaa gctcggcctc 60 

ctcttggnga ttctgggtga catcttcatg aatggcaacc gtgccagwga ggctgtcctc 120 

tgggaggcac tacgcaagat gggactgcgt cctggggtga gacatcctct ccttggagat 180 

ctaacgaaac ttctcaccta tgagttgtaa agcagaaata cctgnactac agacgagtgc 240 

ccaacagcaa ccccccggaa gtatgagttc ctctrgggcc tccgttccta ccatgagasc 300 

tagcaagatg naagtgttga gantcattgc agaggttcag aaaagagacc cntcgtgact 360 

ggtctgcaca gttcatggag gctgcagatg aggccttgga tgctctggat gctgctgcag 420 

ctgaggccga agcccgggct gaagcaagaa cccgcatggg aattggagat gaggctgtgt 480 

ntgggccctg gagctgggat gacattgagt ttgagctgct gacctgggat gaggaaggag 540 

attttggaga tccntggtcc agaattccat ttaccttctg ggccagatac caccagaatg 600 

cccgctccag attccctcag acctttgccg gtcccattat tggtcstggt ggt 653 

<210> 364 

<211> 401 

<212> DNA 

<213> Homo sapien 

<400> 364 

actagaggaa agacgttaaa ccactctact accacttgtg gaactctcaa agggtaaatg 60 

acaaagccaa tgaatgactc taaaaacaat atttacattt aatggtttgt agacaataaa 120 

aaaacaaggt ggatagatct agaattgtaa cattttaaga aaaccatagc atttgacaga 180 

tgagaaagct caattataga tgcaaagtta taactaaact actatagtag taaagaaata 240 

catttcacac ccttcatata aattcactat cttggcttga ggcactccat aaaatgtatc 300 

acgtgcatag taaatcttta tatttgctat ggcgttgcac tagaggactt ggactgcaac 360 

aagtggatgc gcggaaaatg aaatcttctt caatagccca g 401 

<210> 365 
<211> 356 
<212> DNA 
<213> Homo sapien 

<400> 365 

ccagtgtcat atttgggctt aaaatttcaa gaagggcact tcaaatggct ttgcatttgc 60 

atgtttcagt gctagagcgt aggaatagac cctggcgtcc actgtgagat gttcttcagc 120 

taccagagca tcaagtctct gcagcaggtc attcttgggt aaagaaatga cttccacaaa 180 

ctctccatcc cctggctttg gcttcggcct tgcgttttcg gcatcatctc cgttaatggt 24 0 

gactgtcacg atgtgtatag tacagtttga caagcctggg tccatacaga ccgctggaga 300 

acattcggca atgtcccctt tgtagccagt ttcttcttcg agctcccgga gagcag 356 

<210> 366 

<211> 1851 

<212> DNA 

<213> Homo sapien 

<400> 366 

tcatcaccat tgccagcagc ggcaccgtta gtcaggtttt ctgggaatcc cacatgagta 60 
cttccgtgtt cttcattctt cttcaatagc cataaatctt ctagctctgg ctggctgttt 120 
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rcactrcctt taagccttug tgactcttcc tctgatgtca gctttaagtc ttgttctgga 180 

ttgctgtttt cagaagagat tttraacatc tgtttttctt tgtagtcaga aagtaactgg 240 

jaaattacat gatgatgact agaaacagca tactctctgg ccgtctttcc agatcttgag 300 

aagatacarc aacattttgc tcaagtagag ggctgactat acttgctgat ccacaacata 360 

cagcaagtat gagagcagtt cttccatatc tatccagcgc atttaaattc gcttttttcr 420 

tgattaaaaa tttcaccact tgctgttttt gctcatgtat accaagtagc agtggtgtga 480 

ac7ccatgctt gttttttgat tcgatatcag caccgtataa gagcagtgct ttggccatta 540 

atttatcttc attgtagaca gcatagtgta gagtggtatt tccatactca tctggaatat 600 

ttggatcagt gccatgttcc agcaacatta acgcacattc atcttcctgg cattgtacgg 660 

cctttgtcag agctgtcctc tttttgttgt caaggacatt aagttgacat cgtctgtcca 720 

gcacgagttt tactacttct gaattcccat tggcagaggc cagatgtaga gcagtcctct 780 

tttgcttgtc cctcttgttc acatccgtgt ccctgagcat gacgatgaga tcctttctgg 840 

ggactttacc ccaccaggca gctctgtgga gcttgtccag atcttctcca tggacgtggt 900 

acctgggatc catgaaggcg ctgtcatcgt agtctcccca agcgaccacg ttgctcttgc 960 

cyctcccctg cagcagggga agcagtggca gcaccacttg cacctcttgc tcccaagcgt 1020 

cttcacagag gagtcgttgt ggtctccaga agtgcccacg ttgctcttgc cgctccccct 1080 

gtccatccag ggaggaagaa atgcaggaaa tgaaagatgc atgcacgatg gtatactcct 1140 

cagccatcaa acttctggac agcaggtcac ttccagcaag gtggagaaag ctgtccaccc 1200 

acagaggatg agatccagaa accacaatat ccattcacaa acaaacactt ttcagccaga 1260 

cacaggtact gaaatcatgt catctgcggc aacatggtgg aacctaccca atcacacatc 1320 

aagagatgaa gacactgcag tatatctgca caacgtaata ctcttcatcc ataacaaaat 1380 

aatataattt tcctctggag ccatatggat gaactatgaa ggaagaactc cccgaagaag 1440 

ccagtcgcag agaagccaca ctgaagctct gtcctcagcc atcagcgcca cggacaggar 1500 

tgtgtttctt ccccagtgat gcagcctcaa gttatcccga agctgccgca gcacacggtg 1560 

qctcctgaga aacaccccag ctcttccggt ctaacacagg caagtcaata aatgtgataa 1620 

tcacataaac agaattaaaa gcaaagtcac ataagcatct caacagacac agaaaaggca 1680 

tttgacaaaa tccagcatcc ttgtatttat tgttgcagtt ctcagaggaa atgcttctaa 1740 

cttttcccca tttagtatta tgttggctgt gggcttgtca taggtggttt ttattacttt 1800 

aaggtatgtc ccttctatgc ctgttttgct gagggtttta attctcgtgc c 1851 

<210> 367 
<211> 668 
<212> DNA 
<213> Homo sapien 

<400> 367 

cttgagcttc caaataygga agactggccc ttacacasgt caatgttaaa atgaatgcat 60 

Ttcagtattt tgaagataaa attrgtagat ctataccttg ttttttgatt cgatatcagc 120 

accrtataag agcagtgctt tggccattaa tttatctttc attrtagaca gcrtagtgya 180 

gagtggtatt tccatactca tctggaatat ttggatcagt gccatgttcc agcaacatta 240 

acgcacattc atcttcctgg cattgtacgg cctgtcagta ttagacccaa aaacaaatta 300 

catatcttag gaattcaaaa taacattcca cagctttcac caactagtta tatttaaagg 360 

agaaaactca tttttatgcc atgtattgaa atcaaaccca cctcatgctg atatagttgg 420 

ctactgcata cctttatcag agctgtcctc tttttgttgt caaggacatt aagttgacat 480 

cgtctgtcca gcaggagttt tactacttct gaattcccat tggcagaggc cagatgtaga 540 

gcagtcctat gagagtgaga agacttttta ggaaattgta gtgcactagc tacagccata 600 

gcaatgattc atgtaactgc aaacactgaa tagcctgcta ttactctgcc ttcaaaaaaa 660 

aaaaaaaa ggg 

<210> 368 

<211> 1512 

<212> DNA 

<213> Homo sapien 



<400> 368 

qggtcgccca gggggsgcgt gggctttcct cgggtgggtg tgggttttcc ctgggtgggg 60 

^^yggct gggc trgaatcccc tgctggggtt ggcaggtttt ggctgggatt gacttttytc 120 

ttcaaacaga ttggaaaccc ggagttacct gctagttggt gaaactggtt ggtagacgcg 180 
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atctgttggc tactactggc ttctcctggc tgttaaaagc agatggtggt tgaggttgat 240 

tccatgccgg ctgcttcttc tgtgaagaag ccatttggtc tcaggagcaa gatgggcaag 300 

tggtgctgcc gttgcttccc ctgctgcagg gagagcggca agagcaacgt gggcacttct 360 

ggagaccacg acgactctgc tatgaagaca ctcaggagca agatgggcaa gtggtgccgc 420 

cactgcttcc cctgctgcag ggggagtggc aagagcaacg tgggcgcttc tggagaccac 480 

gacgaytctg ctatgaagac actcaggaac aagatgggca agtggtgctg ccactgcttc 540 

ccctgctgca gggggagcrg caagagcaag gtgggcgctt ggggagacta cgatgacagt 600 

gccttcatgg agcccaggta ccacgtccgt ggagaagatc tggacaagct ccacagagct 660 

gcctggtggg gtaaagtccc cagaaaggat ctcatcgtca tgctcaggga cactgacgtg 720 

aacaagaagg acaagcaaaa gaggactgct ctacatctgg cctctgccaa tgggaattca 780 

gaagtagtaa aactcstgct ggacagacga tgtcaactta atgtccttga caacaaaaag 840 

aggacagctc tgayaaaggc cgtacaatgc caggaagatg aatgtgcgtt aatgttgctg 900 

gaacatggca ctgatccaaa tattccagat gagtatggaa ataccactct rcactaygct 960 

rtctayaatg aagataaatt aatggccaaa gcactgctct tatayggtgc tgatatcgaa 1020 

tcaaaaaaca aggtatagat ctactaattt tatcttcaaa atactgaaat gcattcattt 1080 

taacattgac gtgtgtaagg gccagtcttc cgtatttgga agctcaagca taacttgaat 1140 

gaaaatattt tgaaatgacc taattatctm agactttatt ttaaatattg ttattttcaa 1200 

agaagcatta gagggtacag tttttttttt ttaaatgcac ttctggtaaa tacttttgtt 1260 

gaaaacactg aatttgtaaa aggtaatact tactattttt caatttttcc ctcctaggat 1320 

ttttttcccc taatgaatgt aagatggcaa aatttgccct gaaataggtt ttacatgaaa 1380 

actccaagaa aagttaaaca tgtttcagtg aatagagatc ctgctccttt ggcaagttcc 1440 

taaaaaacag taatagatac gaggtgatgc gcctgtcagt ggcaaggttt aagatatttc 1500 

tgatctcgtg cc 1512 

<210> 369 

<211> 1853 

<212> DNA 

<213> Homo sapien 

<400> 369 

gggtcgccca gggggsgcgt gggctttcct cgggtgggtg tgggttttcc ctgggtgggg 60 

tgggctgggc trgaatcccc tgctggggtt ggcaggtttt ggctgggatt gacttttytc 120 

Tvo/^im/^— 1 ^^_^„4-4 ^ 4- „ —4 ,X- 1- ^ ~r-X- -~ - ^ -_x,4- _j _ _ ., - „ r\ r\ 

^ ^ v^^vjoi. ^^^^ac*a^<^w y vjci-y c L.a wo U y <^ uay c gL. gticicK^ L.gg L C ggUdy d^^g (Jy XOU 

atctgttggc tactactggc ttctcctggc tgttaaaagc agatggtggt tgaggttgat 240 

tccatgccgg ctgcttcttc tgtgaagaag ccatttggtc tcaggagcaa gatgggcaag 300 

tggtgctgcc gttgcttccc ctgctgcagg gagagcggca agagcaacgt gggcacttct 360 

ggagaccacg acgactctgc tatgaagaca ctcaggagca agatgggcaa gtggtgccgc 420 

cactgcttcc cctgctgcag ggggagtggc aagagcaacg tgggcgcttc tggagaccac 480 

gacgaytctg ctatgaagac actcaggaac aagatgggca agtggtgctg ccactgcttc 540 

ccctgctgca gggggagcrg caagagcaag gtgggcgctt ggggagacta cgatgacagy 600 

gccttcatgg akcccaggta ccacgtccrt ggagaagatc tggacaagct ccacagagct 660 

gcctggtggg gtaaagtccc cagaaaggat ctcatcgtca tgctcaggga cackgaygtg 720 

aacaagargg acaagcaaaa gaggactgct ctacatctgg cctctgccaa tgggaattca 780 

gaagtagtaa aactcstgct ggacagacga tgtcaactta atgtccttga caacaaaaag 840 

aggacagctc tgayaaaggc cgtacaatgc caggaagatg aatgtgcgtt aatgttgctg 900 

gaacatggca ctgatccaaa tattccagat gagtatggaa ataccactct rcactaygct 960 

rtctayaatg aagataaatt aatggccaaa gcactgctct tatayggtgc tgatatcgaa 1020 

tcaaaaaaca agcatggcct cacaccactg ytacttggtr tacatgagca aaaacagcaa 1080 

gtsgtgaaat ttttaatyaa gaaaaaagcg aatttaaaat gcrctggata gatatggaag 1140 

ractgctctc atacttgctg tatgttgtgg atcagcaagt atagtcagcc ytctacttga 1200 

gcaaaatrtt gatgtatctt ctcaagatct ggaaagacgg ccagagagta tgctgtttct 1260 

agtcatcatc atgtaatttg ccagttactt tctgactaca aagaaaaaca gatgttaaaa 1320 

atctcttctg aaaacagcaa tccagaacaa gacttaaagc tgacatcaga ggaagagtca 1380 

caaaggctta aaggaagtga aaacagccag ccagaggcat ggaaactttt aaatttaaac 1440 

ttttggttta atgttttttt tttttgcctt aataatatta gatagtccca aatgaaatwa 1500 

GCtatgagac taggctttga gaatcaatag attctttttt taagaatctt ttggctagga 1560 

gcggtgtctc acgcctgtaa ttccagcacc ttgagaggct gaggtgggca gatcacgaga 1620 

tcaggagatc gagaccatcc tggctaacac ggtgaaaccc catctctact aaaaatacaa 1680 
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aaacttagct gggtgtggtg gcgggtgcct gtagtcccag ctactcagga rgctgaggca 1740 
ggagaatggc atgaacccgg gaggtggagg ttgcagtgag ccgagatccg ccactacact 1800 
ccaqcctggg tgacagagca agactctgtc tcaaaaaaaa aaaaaaaaaa aaa 1853 

<210> 370 

<211> 2184 

<212> DNA 

<213> Homo sapien 



<400> 370 

ggcacgagaa ttaaaaccct cagcaaaaca ggcatagaag ggacatacct taaagtaata 60 

aaaaccacct atgacaagcc cacagccaac ataatactaa atggggaaaa gttagaagca 120 

tttcctctga gaactgcaac aataaataca aggatgctgg attttgtcaa atgccttttc 180 

tgtgtctgtt gagatgctta tgtgactttg cttttaattc tgtttatgtg attatcacat 240 

Itattgactt gcctgtgtta gaccggaaga gctggggtgt ttctcaggag ccaccgtgtg 300 

ctgcggcagc ttcgggataa cttgaggctg catcactggg gaagaaacac aytcctgtcc 360 

gtggcgctga tggctgagga cagagcttca gtgtggcttc tctgcgactg gcttcttcgg 420 

ggagttcttc cttcatagtt catccatatg gctccagagg aaaattatat tattttgtta 480 

tggatgaaga gtattacgtt gtgcagatat actgcagtgt cttcatctct tgatgtgtga 540 

ttgggtaggt tccaccatgt tgccgcagat gacatgattt cagtacctgt gtctggctga 600 

aaagtgtttg tttgtgaatg gatattgtgg tttctggatc tcatcctctg tgggtggaca 660 

gctttctcca ccttgctgga agtgacctgc tgtccagaag tttgatggct gaggagtata 720 

ccatcgtgca tgcatctttc atttcctgca tttcttcctc cctggatgga cagggggagc 780 

ggcaagagca acgtgggcac ttctggagac cacaacgact cctctgtgaa gacgcttggg 840 

agcaagaggt gcaagtggtg ctgccactgc ttcccctgct gcaggggagc ggcaagagca 900 

acgtggtcgc ttggggagac tacgatgaca gcgccttcat ggatcccagg taccacgtcc 960 

atggagaaga tctggacaag ctccacagag ctgcctggtg gggtaaagtc cccagaaagg 1020 

atctcatcgt catgctcagg gacacggatg tgaacaagag ggacaagcaa aagaggactg 1080 

ctctacatct ggcctctgcc aatgggaatt cagaagtagt aaaactcgtg ctggacagac 1140 

gatgtcaact taatgtcctt gacaacaaaa agaggacagc tctgacaaag gccgtacaat 1200 

gccaggaaga tgaatgtgcg ttaatgttgc tggaacatgg cactgatcca aatattccag 1260 

atgagtatgg aaataccact ctacactatg ctgtctacaa tgaagataaa ttaatggcca 1320 

aagcactgct cttatacggt gctgatatcg aatcaaaaaa caagcatggc ctcacaccac 1380 

rgctacttgg tatacatgag caaaaacagc aagtggtgaa atttttaatc aagaaaaaag 1440 

cgaatttaaa tgcgctggat agatatggaa gaactgctct catacttgct gtatgttgtg 1500 

gatcagcaag tatagtcagc cctctacttg agcaaaatgt tgatgtatct tctcaagatc 1560 

V.ggaaagacg gccagagagt atgctgtttc tagtcatcat catgtaattt gccagttact 1620 

ttctgactac aaagaaaaac agatgttaaa aatctcttct gaaaacagca atccagaaca 1680 

agacttaaag ctgacatcag aggaagagtc acaaaggctt aaaggaagtg aaaacagcca 1740 

gccagaggca tggaaacttt taaatttaaa cttttggttt aatgtttttt ttttttgcct 1800 

taataatatt agatagtccc aaatgaaatw acctatgaga ctaggctttg agaatcaata 1860 

gattcttttt ttaagaatct tttggctagg agcggtgtct cacgcctgta attccagcac 1920 

cttgagaggc tgaggtgggc agatcacgag atcaggagat cgagaccatc ctggctaaca 1980 

cggtgaaacc ccatctctac taaaaataca aaaacttagc tgggtgtggt ggcgggtgcc 2040 

tgtagtccca gctactcagg argctgaggc aggagaatgg catgaacccg ggaggtggag 2100 

gttgcagtga gccgagatcc gccactacac tccagcctgg gtgacagagc aagactctgt 2160 

ctcaaaaaaa aaaaaaaaaa aaaa 2184 



<210> 371 

<211> 1855 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> {1} . . . (1855) 
<223> n = A, T, C or G 
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<400> 371 

tgcacgcatc ggccagtgtc tgtgccacgt acactgacgc 
cacgcgcacg ttgcacgcgc ggcagcggct tggctggctt 
gccgcccccg cataaccgtc agactggcct gtaacggctt 
cgtaacggct tggctgccct gtaacggctt gcacgtgcat 
ttggctggca tgtagccgct tggcttggct ttgcattytt 
tcttggattg acgcttcctc cttggatkga cgtttcctcc 
tcgcgttcct ttgctggact tgacctttty tctgctgggt 
gctgggtgtt ttctccgggg gggktkgccc ttcctggggt 
gggcgtgggc tttccccggg tgggtgtggg ttttcctggg 
atccccctgc tggggttggc agggattgac ttttttcttc 
gtaacntgct agttggtgaa actggttggt agacgcgatc 
ctggctgtta aaagcagatg gtggctgagg ttgattcaat 
agaagccatt tggtctcagg agcaagatgg gcaagtggtg 
cagggggagc ggcaagagca acgtgggcac ttctggagac 
gacgcttggg agcaagaggt gcaagtggtg ctgcccactg 
cggcaagagc aacgtggkcg cttggggaga ctacgatgac 
gtaccacgtc crtggagaag atctggacaa gctccacaga 
ccccagaaag gatctcatcg tcatgctcag ggacactgay 
aaagaggact gctctacatc tggcctctgc caatgggaat 
gctggacaga cgatgtcaac ttaatgtcct tgacaacaaa 
ggccgtacaa tgccaggaag atgaatgtgc gttaatgttg 
aaatattcca gatgagtatg gaaataccac tctacactat 
attaatggcc aaagcactgc tcttatacgg tgctgatatc 
gatctactaa ttttatcttc aaaatactga aatgcattca 
agggccagtc ttccgtattt ggaagctcaa gcataacttg 
acctaattat ctaagacttt attttaaata ttgttatttt 
cagttttttt tttttaaatg cacttctggt aaatactttt 
aaaaggtaat acttactatt tttcaatttt tccctcctag 
tgtaagatgg caaaatttgc cctgaaatag gttttacatg 
acatgtttca gtgaatagag atcctgctcc tttggcaagt 
tacgaggtga tgcgcctgtc agtggcaagg tttaagatat 



cccctgagat 

gtaacggctt 

gcaggcgcac 

gctgcacgcg 

tgctkggctk 

ttggatkgac 

ttggcattcc 

gggcgtgggk 

gtggggtggg 

aaacagattg 

tgctggtact 

gccggctgct 

cgccactgct 

cacaacgact 

cttcccctgc 

agcgccttca 

gctgcctggt 

gtgaacaaga 

tcagaagtag 

aagaggacag 

ctggaacatg 

gctgtctaca 

gaatcaaaaa 

ttttaacatt 

aatgaaaata 

caaagaagca 

gttgaaaaca 

gatttttttc 

aaa.actccaa 

tcctaaaaaa 

ttctgatctc 



gtgcacgccg 

gcacgcgcac 

gccgcacgcg 

cgttaacggc 

ggcgttgkty 

gtttcytyty 

tttggggtgg 

cgcccccagg 

ctgtgctggg 

gaaacccgga 

actgtttctc 

tcttctgtga 

tcccctgctg 

cctctgtgaa 

tgcaggggag 

tggakcccag 

ggggtaaagt 

rggacaagca 

taaaactcgt 

ctctgacaaa 

gcactgatcc 

atgaagataa 

acaaggtata 

gacgtgtgta 

ttttgaaatg 

ttagagggta 

ctgaatttgt 

ccctaatgaa 

gaaaagttaa 

cagtaataga 

gtgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1855 



<210> 372 

<211> 1059 

<212> DNA 

<213> Homo sapien 



<400 
gcaacgtggg 
ggtgcaagtg 
gcgcttgrgg 
aagat ctgga 
atcgtcatgc 
catctggcct 
caacttaatg 
gaagatgaat 
tatggaaata 
ctgctcttat 
cttcaaaata 
atttggaagc 
ctttatttta 
aatgcacttc 
tatttttcaa 
ttgccctgaa 
agagatcctg 
tgtcagtggc 



> 372 
cacttctgga 
gtgctgccca 
agactmcgat 
caagctccac 
tcagggacac 
ctgccaatgg 
tccttgacaa 
gtgcgttaat 
ccactctrca 
ayggtgctga 
ctgaaatgca 
tcaagcataa 
aatattgtta 
tggtaaatac 
tttttccctc 
ataggtttta 
ctcctttggc 
aaggtttaag 



gaccacaacg 
ctgcttcccc 
gacagygcct 
agagctgccc 
tgaygtgaac 
gaattcagaa 
caaaaagagg 
gttgctggaa 
ctaygctrtc 
tatcgaatca 
ttcattttaa 
cttgaatgaa 
ttttcaaaga 
ttttgttgaa 
ctaggatttt 
catgaaaact 
aagttcctaa 
atatttctga 



actcctctgt 
tgctgcaggg 
tcatggagcc 
tggtggggta 
aagarggaca 
gtagtaaaac 
acagctctga 
catggcactg 
tayaatgaag 
aaaaacaagg 
cattgacgtg 
aatattttga 
agcattagag 
aacactgaat 
tttcccctaa 
ccaagaaaag 
aaaacagtaa 
tctcgtgcc 



gaagacgctt 
gagcggcaag 
caggtaccac 
aagtccccag 
agcaaaagag 
tcstgctgga 
yaaaggccgt 
atccaaatat 
ataaattaat 
tatagatcta 
tgtaagggcc 
aatgacctaa 
ggtacagttt 
ttgtaaaagg 
tgaatgtaag 
ttaaacatgt 
tagatacgag 



gggagcaaga 
agcaacgtgg 
gtccgtggag 
aaaggatctc 
gactgctcta 
cagacgatgt 
acaatgccag 
tccagatgag 
ggccaaagca 
ctaattttat 
agtcttccgt 
ttatctaaga 
ttttttttta 
taatacttac 
atggcaaaat 
ttcagtgaat 
gtgatgcgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1059 
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<210> 373 

<2ll> 1155 

<212> DNA 

<213> Horao sapien 



<400> 373 

atggtggttg aggttgattc catgccggct gcctcttctg tgaagaagcc atttggtctc 60 

agjagcaaga tgggcaagtg gtgctgccgt tgcttcccct gctgcaggga gagcggcaag 120 

agcaacgtgg gcacttctgg agaccacgac gactctgcta tgaagacact caggagcaag 180 

dtgggcaagt ggtgccgcca ctgcttcccc tgctgcaggg ggagtggcaa gagcaacgtg 240 

gqcgcttctg gagaccacga cgactctgct atgaagacac tcaggaacaa gatgggcaag 300 

tggtgctgcc actgcttccc ctgctgcagg gggagcggca agagcaaggt gggcgcttgg 360 

ggagactacg atgacagtgc cttcatggag cccaggtacc acgtccgtgg agaagatctg 420 

gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 

ctcagggaca ctgacgtgaa caagaaggac aagcaaaaga ggactgctct acatctggcc 540 

tctgccaatg ggaattcaga agtagtaaaa ctcctgctgg acagacgatg tcaacttaat 600 

gtccttgaca acaaaaagag gacagctctg ataaaggccg tacaatgcca ggaagatgaa 660 

tgtgcgttaa tgttgctgga acatggcact gatccaaata ttccagatga gtatggaaat 720 

accactctgc actacgctat ctataatgaa gataaattaa tggccaaagc actgctctta 780 

tatggtgctg atatcgaatc aaaaaacaag catggcctca caccactgtt acttggtgta 840 

catgagcaaa aacagcaagt cgtgaaattt ttaatcaaga aaaaagcgaa tttaaatgca 900 

ctggatagat atggaaggac tgctctcata cttgctgtat gttgtggatc agcaagtata 960 

gtcagccttc tacttgagca aaatattgat gtatcttctc aagatctatc tggacagacg 1020 

qccagagagt atgctgtttc tagtcatcat catgtaattt gccagttact ttctgactac 1080 

aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaaa tgtctcaaga 1140 

accagaaata aataa 1155 



<210> 374 

<211> 2000 

<212> DNA 

<213> Homo sapien 



<400> 374 

riiggtggttg aggttgattc catgccggct gcctcttctg tgaagaagcc atttggtctc 60 

aggagcaaga tgggcaagtg gtgctgccgt tgcttcccct gctgcaggga gagcggcaag 120 

agcaacgtgg gcacttctgg agaccacgac gactctgcta tgaagacact caggagcaag 180 

atgggcaagt ggtgccgcca ctgcttcccc tgctgcaggg ggagtggcaa gagcaacgtg 240 

ggcgcttctg gagaccacga cgactctgct atgaagacac tcaggaacaa gatgggcaag 300 

tggtgctgcc actgcttccc ctgctgcagg gggagcggca agagcaaggt gggcgcttgg 360 

ggagactacg atgacagtgc cttcatggag cccaggtacc acgtccgtgg agaagatctg 420 

gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 4 80 

ctcagggaca ctgacgtgaa caagaaggac aagcaaaaga ggactgctct acatctggcc 540 

tctgccaatg ggaattcaga agtagtaaaa ctcctgctgg acagacgatg tcaacttaat 600 

gtccttgaca acaaaaagag gacagctctg ataaaggccg tacaatgcca ggaagatgaa 660 

"gtgcgttaa tgttgctgga acatggcact gatccaaata ttccagatga gtatggaaat 720 

accactctgc actacgctat ctataatgaa gataaattaa tggccaaagc actgctctta 780 

tatggtgctg atatcgaatc aaaaaacaag catggcctca caccactgtt acttggtgta 840 

catgagcaaa aacagcaagt cgtgaaattt ttaatcaaga aaaaagcgaa tttaaatgca 900 

ctggatagat atggaaggac tgctctcata cttgctgtat gttgtggatc agcaagtata 960 

gtcagccttc tacttgagca aaatattgat gtatcttctc aagatctatc tggacagacg 1020 

gccagagagt atgctgtttc tagtcatcat catgtaattt gccagttact ttctgactac 1080 

aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaca agacttaaag 1140 

ctgacatcag aggaagagtc acaaaggttc aaaggcagtg aaaatagcca gccagagaaa 1200 

atgtctcaag aaccagaaat aaataaggat ggtgatagag aggttgaaga agaaatgaag 1260 

aagcatgaaa gtaataatgt gggattacta gaaaacctga ctaatggtgt cactgctggc 1320 

aatggtgata atggattaat tcctcaaagg aagagcagaa cacctgaaaa tcagcaattt 1380 

cctgacaacg aaagtgaaga gtatcacaga atttgcgaat tagtttctga ctacaaagaa 1440 

aaacagatgc caaaatactc ttctgaaaac agcaacccag aacaagactt aaagctgaca 1500 
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tcagaggaag agtcacaaag gcttgagggc agtgaaaatg gccagccaga gctagaaaat 1560 

tttatggcta tcgaagaaat gaagaagcac ggaagtactc atgtcggatt cccagaaaac 1620 

ctgactaatg gtgccactgc tggcaatggt gatgatggat taattcctcc aaggaagagc 1680 

agaacacctg aaagccagca atttcctgac actgagaatg aagagtatca cagtgacgaa 1740 

caaaatgata ctcagaagca attttgtgaa gaacagaaca ctggaatatt acacgatgag 1800 

attctgattc atgaagaaaa gcagatagaa gtggttgaaa aaatgaattc tgagctttct 1860 

cttagttgta agaaagaaaa agacatcttg catgaaaata gtacgttgcg ggaagaaatt 1920 

gccatgctaa gactggagct agacacaatg aaacatcaga gccagctaaa aaaaaaaaaa 1980 

aaaaaaaaaa aaaaaaaaaa 2000 

<210> 375 

<211> 2040 

<212> DMA 

<213> Homo sapien 

<400> 375 

atggtggttg aggttgattc catgccggct gcctcttctg tgaagaagcc atttggtctc 60 

aggagcaaga tgggcaagtg gtgctgccgt tgcttcccct gctgcaggga gagcggcaag 120 

agcaacgtgg gcacttctgg agaccacgac gactctgcta tgaagacact caggagcaag 180 

atgggcaagt ggtgccgcca ctgcttcccc tgctgcaggg ggagtggcaa gagcaacgtg 24 0 

ggcgcttctg gagaccacga cgactctgct atgaagacac tcaggaacaa gatgggcaag 300 

tggtgctgcc actgcttccc ctgctgcagg gggagcggca agagcaaggt gggcgcttgg 360 

ggagactacg atgacagtgc cttcatggag cccaggtacc acgtccgtgg agaagatctg 420 

gacaagctcc acagagctgc ctggtggggt aaagtcccca gaaaggatct catcgtcatg 480 

ctcagggaca ctgacgtgaa caagaaggac aagcaaaaga ggactgctct acatctggcc 540 

tctgccaatg ggaattcaga agtagtaaaa ctcctgctgg acagacgatg tcaacttaat 600 

gtccttgaca acaaaaagag gacagctctg ataaaggccg tacaatgcca ggaagatgaa 660 

tgtgcgttaa tgttgctgga acatggcact gatccaaata ttccagatga gtatggaaat 720 

accactctgc actacgctat ctataatgaa gataaattaa tggccaaagc actgctctta 780 

tatggtgctg atatcgaatc aaaaaacaag catggcctca caccactgtt acttggtgta 840 

catgagcaaa aacagcaagt cgtgaaattt ttaatcaaga aaaaagcgaa tttaaatgca 900 

ctggatagat atggaaggac tgctctcata cttgctgtat gttgtggatc agcaagtata 960 

L,^-ayc,a cidca ucii, ogd u g l. dt cl t_ c L u ctagatctatc tggacagacg lOZv 

gccagagagt atgctgtttc tagtcatcat catgtaattt gccagttact ttctgactac 1080 

aaagaaaaac agatgctaaa aatctcttct gaaaacagca atccagaaca agacttaaag 1140 

ctgacatcag aggaagagtc acaaaggttc aaaggcagtg aaaatagcca gccagagaaa 1200 

atgtctcaag aaccagaaat aaataaggat ggtgatagag aggttgaaga agaaatgaag 1260 

aagcatgaaa gtaataatgt gggattacta gaaaacctga ctaatggtgt cactgctggc 1320 

aatggtgata atggattaat tcctcaaagg aagagcagaa cacctgaaaa tcagcaattt 1380 

cctgacaacg aaagtgaaga gtatcacaga atttgcgaat tagtttctga ctacaaagaa 14 4 0 

aaacagatgc caaaatactc ttctgaaaac agcaacccag aacaagactt aaagctgaca 1500 

tcagaggaag agtcacaaag gcttgagggc agtgaaaatg gccagccaga gaaaagatct 1560 

caagaaccag aaataaataa ggatggtgat agagagctag aaaattttat ggctatcgaa 1620 

gaaatgaaga agcacggaag tactcatgtc ggattcccag aaaacctgac taatggtgcc 1680 

actgctggca atggtgatga tggattaatt cctccaagga agagcagaac acctgaaagc 1740 

cagcaatttc ctgacactga gaatgaagag tatcacagtg acgaacaaaa tgatactcag 1800 

aagcaatttt gtgaagaaca gaacactgga atattacacg atgagattct gattcatgaa 1860 

gaaaagcaga tagaagtggt tgaaaaaatg aattctgagc tttctcttag ttgtaagaaa 1920 

gaaaaagaca tcttgcatga aaatagtacg ttgcgggaag aaattgccat gctaagactg 1980 

gagctagaca caatgaaaca tcagagccag ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2040 

<210> 376 
<211> 329 
<212> PRT 
<213> Homo sapien 

<400> 376 

Met Asp He Val Val Ser Gly Ser His Pro Leu Trp Val Asp Ser Phe 
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1 


















10 










15 




Leu 


His 


Leu 




Gly 


Ser 


Asp 


Leu 


Leu 

25 


Ser 


Arg 


Ser 


Leu 


Met 

3C 


Ala 


Glu 




Tyr 


Thr 


20 
He 


Val 


His 


Ala 


Ser 


Phe 


He 


Ser 


Cys 


He 


Ser 


Ser 


Ser 






35 










40 










45 










Asp 


Gly 


Gin 


Gly 


Glu 


Arg 


Gin 


Glu 


Gin 


Arg 


Gly 


His 


Phe 


Trp 


Arg 




50 










55 










60 










: ■ r 


Gl n 


Arq 


Leu 


Leu 


Cys 


Glu 


Asp 


Ala 


Trp 


Glu 


Gin 


Glu 


Val 


Gin 


Val 


*n 5 










70 










75 










80 




Leu 


Pro 


Leu 


Leu 


Pro 


Leu 


Leu 


Gin 


Gly 


Ser 


Gly 


Lys 


Ser 


Asn 


Val 










85 










90 










95 






Ala 


Trp 


Gly 


Asp 


Tyr 


Asp 


Asp 


Ser 


Ala 


Phe 


Met 


Asp 


Pro 


Arg 


Tyr 








100 










105 










110 






ills 


Val 


His 


Gly 


Glu 


Asp 


Leu 


Asp 


Lys 


Leu 


His 


Arg 


Ala 


Ala 


Trp 


Trp 






115 










120 










125 








Gly 


Lys 


Val 


?r o 


Arg 


Lys 


Asp 


Leu 


He 


Val 


Met 


Leu 


Arg 


Asp 


Thr 


Asp 




130 










135 










140 










Vaj 


Asn 


Lys 


Arq 


Arp 


Lys 


Gin 


Lys 


Arg 


Thr 


Ala 


Leu 


His 


Leu 


Ala 


Ser 


14 5 




















155 










160 


Ala 


Asn 


Gly 


A^^r 


Ser 


Glu 


Val 


Val 


Lys 


Leu 


Val 


Leu 


Asp 


Arg Arg 


Cys 










lt5 










170 










175 




Gin 


Leu 


A.sn 


Val 


Leu 


Asp 


Asn 


Lys 


Lys 


Arg 


Thr 


Ala 


Leu 


Thr 


Jjys 


Ala 








:eo 










185 










190 






Val 


Gin 


Cys 


Gin 


GH 


Asp 


Glu 


Cys 


Ala 


Leu 


Met 


Leu 


Leu 


Glu 


His 


Gly 






195 










200 










205 








Thr 


Asp 


Pro 


Asn 


He 


Pro 


Asp 


Glu 


Tyr 


Gly 


Asn 


Thr 


Thr 


Leu 


His 


Tyr 




210 










215 










220 










Ala 


Val 


Tyr 


Asn 


Glu 


Asp 


Lys 


Leu 


Met 


Ala 


Lys 


Ala 


Leu 


Leu 


Leu 


Tyr 


225 










230 










235 










240 


Gly 


Ala 


Asp 


He 


Glu 


Ser 


Lys 


Asn 


Lys 


His 


Gly 


Leu 


Thr 


Pro 


Leu 


Leu 










2^5 










250 










255 




Leu 


Gly 


He 


His 


Glu 


Gin 


Lys 


Gin 


Gin 


Val 


Val 


Lys 


Phe 


Leu 


He 


Lys 








260 










265 










270 






l,y.^ 


Lys 


Ala 


Asn 


Leu 


Asn 


Ala 


Leu 


Asp 


Arg 


Tyr 


Gly 


Arg 


Thr 


Ala 


Leu 






275 










280 










285 








He 


Leu 


Ala 


Val 


Cys 


Cys 


Gly 


Ser 


Ala 


Ser 


He 


Val 


Ser 


Pro 


Leu 


Leu 




2 90 










295 










300 










Glu 


Gin 


Asn 


Val 


Asp 


Val 


Ser 


Ser 


Gin 


Asp 


Leu 


Glu 


Arg 


Arg 


Pro 


Glu 


'^05 










310 










315 










320 


Ser 


Met 


Leu 


Phe 


Leu 


Val 


He 


He 


Met 

















325 



<210> 377 
<211> 148 
<212> PRT 
<213> Homo sapien 



<220> 

<221> VARIANT 

<222> (1) . , . (148) 

<223> Xaa = Any Amino Acid 



<400> 377 

Met. Thr Xaa Pro Ser Trp Ser Pro Gly Thr Thr Ser Val Glu Lys He 

1 5 10 15 

Trp Thr Ser Ser Thr Glu Leu Pro Trp Trp Gly Lys Val Pro Ajrg Lys 

20 ■ 25 30 

Asp Leu He Val Met Leu Arg Asp Thr Asp Val Asn Lys Xaa Asp Lys 



^33 A' 



wo 01/51633 PCT/l]S01/01574 

125 



35 

Gin Lys Arg Thr 
50 

Val Val Lys Leu 
65 

Asn Lys Lys Arg 

Glu Cys Ala Leu 
100 

Asp Glu Tyr Gly 

115 

Lys Leu Met Ala 
130 

Lys Asn Lys Val 
145 



40 

Ala Leu His Leu 
55 

Xaa Leu Asp Arg 
70 

Thr Ala Leu Xaa 
85 

Met Leu Leu Glu 

Asn Thr Thr Leu 
120 

Lys Ala Leu Leu 
135 



Ala Ser Ala Asn 
60 

Arg Cys Gin Leu 
75 

Lys Ala Val Gin 
90 

His Gly Thr Asp 
105 

His Tyr Ala Xaa 

Leu Tyr Gly Ala 
140 



45 

Gly Asn Ser Glu 

Asn Val Leu Asp 
80 

Cys Gin Glu Asp 
95 

Pro Asn lie Pro 
110 

Tyr Asn Glu Asp 
125 

Asp lie Glu Ser 



<210> 378 

<211> 1719 

<212> PRT 

<213> Homo sapien 



<400> 378 

Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys 

15 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe 

20 25 30 

Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp 

35 40 45 

His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Gly Lvs Trp 

50 55 60 

Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val 
^5 70 75 80 

^-^j ^.^^ ^ — rio^^ i.^o ii>.^i^ o^r ria-a wet ijys inr Leu Arg Asn 

85 90 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 

100 105 110 

Gly Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe 

115 120 125 

Met Glu Pro Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys Leu His 

130 135 140 

Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu lie Val Met 
14^ 150 155 160 

Leu Arg Asp Thr Asp Val Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala 

165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu 

180 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 

195 200 205 

Ala Leu He Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met 

210 215 220 

Leu Leu Glu His Gly Thr Asp Pro Asn He Pro Asp Glu Tyr Gly Asn 
225 230 235 240 

Thr Thr Leu His Tyr Ala He Tyr Asn Glu Asp Lys Leu Met Ala Lys 

245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp He Glu Ser Lys Asn Lys His Gly 

260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 
275 280 285 
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Phe 


Leu 


He 


Lys 


Lys 




290 










G.ly 


Arg 


Thr 


Ala 


Leu 


He 












310 


V a 1 


Ser 


Leu 


Leu 


Leu 


Glu 










325 




Ser 


Gly 


Gin 


Thr 


Ala 


Arg 








340 






He 


Cys 


Gin 


Leu 


Leu 


Ser 






355 








Ser 


Ser 


Glu 


Asn 


Ser 


Asn 




370 










Pro 


Arg 


Thr 


His 


Met 


Val 


38b 










390 


Ser 


Val 


Lys 


Lys 


Pro 


Phe 










405 




Cys 


Arg 


Cys 


Phe 


Pro 


Cys 








420 






Thr 


Ser 


Gly 


Asp 


His 


Asp 






435 








Met 


Gly 


Lys 


Trp 


Cys 


Arg 




450 










Lys 


Ser 


Asn 


Val 


Gly 


Ala 


A 6b 










470 


Tnr 


Leu 


Arg 


Asn 


Lys 


Met 










485 




Cys 


Arg 


Gly 


Ser 


Gly 


Lys 








500 






Asp 


Ser 


Ala 


Phe 


Met 


Glu 






515 








Asp 


Lys 


Leu 


His 


Arg 


Ala 




530 










Leu 


He 


Val 


Met 


Leu 


Arg 


545 










550 


hys 


Arg 


Thr 


Ala 


Leu 


His 










565 




Va} 


Lys 


Leu 


Leu 


Leu 


Asp 








580 






Lys 


Lys 


Arg 


Thr 


Ala 


Leu 






595 








Cys 


Ala 


Leu 


Met 


Leu 


Leu 




610 












Tyr 


Gly 


Asn 


Thr 


Thr 


525 










630 


Leu 


Met 


Ala 


Lys 


Ala 


Leu 










645 




Asn 


Lys 


His 


Gly 


Leu 


Thr 








660 






Gin 


Gin 


Val 


Val 


Lys 


Phe 






675 








Leu 


Asp 


Arg 


Tyr 


Gly 


Arg 




690 










Ser 


Ala 


Ser 


He 


Val 


Ser 


/Ob 










710 


Ser 


Gin 


Asp 


Leu 


Ser 


Gly 










725 




H i r 


H J s 


His 


Val 


He 





740 



126 



Lys 


Ala 


Asn 


Leu 


Asn 


Ala 


2 95 










300 


Leu 


Ala 


Val 


Cys 


Cys 


Gly 










315 




Gin 


Asn 


He 


Asp 


Val 


Ser 








330 






Glu 


Tyr 


Ala 


Val 


Ser 


Ser 






345 








Asp 


Tyr 


Lys 


Glu 


Lys 


Gin 




360 










Pro 


Glu 


Asn 


Val 


Ser 


Arg 


375 










380 


Val 


Glu 


Val 


Asp 


Ser 


Met 










395 




Gly 


Leu 


Arg 


Ser 


Lys 


Met 








410 






Cys 


Arg 


Glu 


Ser 


Gly 


Lys 






425 








Asp 


Ser 


Ala 


Met 


Lys 


Thr 




440 










His 


Cys 


Phe 


Pro 


Cys 


Cys 


455 










460 


Ser 


Gly 


Asp 


His 


Asp 


Asp 










4 75 




Gly 


Lys 


Trp 


Cys 


Cys 


His 








490 






Ser 


Lys 


Val 


Gly 


Ala 


Trp 






505 








Pro 


Arg 


Tyr 


His 


Val 


Arg 




520 










Ala 


Trp 


Trp 


Gly 


Lys 


Val 


535 










540 


Asp 


Thr 


Asp 


Val 


Asn 


Lys 










555 




Leu 


Ala 


Ser 


Ala 


Asn 


Gly 








570 






Arg 


Arg 


Cys 


Gin 


Leu 


Asn 






585 








He 


Lys 


Ala 


Val 


Gin 


Cys 




600 










Glu 


His 


Gly 


Thr 


Asp 


Pro 


615 










620 


Leu 


His 


Tyr 


Ala 


He 


Tyr 










635 




Leu 


Leu 


Tyr 


Gly 


Ala 


Asp 








650 






Pro 


Leu 


Leu 


Leu 


Gly 


Val 






665 








Leu 


He 


Lys 


Lys 


Lys 


Ala 




680 










Thr 


Ala 


Leu 


He 


Leu 


Ala 


695 










700 


Leu 


Leu 


Leu 


Glu 


Gin 


Asn 










715 




Gin 


Thr 


Ala 


Arg 


Glu 


Tyr 








730 






Gin 


Leu 


Leu 


Ser 


Asp 


Tyr 



745 
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Leu 


Asp 


Arg 


Tyr 


Ser 


Ala 


Ser 


He 








320 


Ser 


Gin 


Asp 


Leu 






335 




His 


His 


His 


Val 




350 






Met 


Leu 


Lys 


He 


365 








Thr 


Arg 


Asn 


Lys 


Pro 


Ala 


Ala 


Ser 








400 


Gly 


Lys 


Trp 


Cys 






415 




Ser 


Asn 


Val 


Gly 




430 






Leu 


Arg 


Ser 


Lys 


445 








Arg 


Gly 


Ser 


Gly 


Ser 


Ala 


Met 


Lys 








480 


Cys 


Phe 


Pro 


Cys 






495 




Gly 


Asp 


Tyr 


Asp 




510 






Gly 


Glu 


Asp 


Leu 


525 








Pro 


Arg 


Lys 


Asp 


Lys 


Asp 


Lys 


Gin 








560 


Asn 


Ser 


Glu 


Val 






575 




Val 


Leu 


Asp 


Asn 




590 






Gin 


Glu 


Asp 


Glu 


605 








Asn 


He 


Pro 


Asp 


Asn 


Glu 


Asp 


Lys 








64 0 


He 


Glu 


Ser 


Lys 






655 




His 


Glu 


Gin 


Lys 




670 






Asn 


Leu 


Asn 


Ala 


685 








Val 


Cys 


Cys 


Gly 


He 


Asp 


Val 


Ser 








720 


Ala 


Val 


Ser 


Ser 






735 




Lys 


Glu 


Lys 


Gin 




750 
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Met Leu Lys lie Ser Ser Glu Asn Ser Asn Pro Glu Gin Asp Leu Lys 

760 765 
Leu Thr Ser Glu Glu Glu Ser Gin Arg Phe Lys Gly Ser Glu Asn Ser 

'^'^^ 775 780 

Gin Pro Glu Lys Met Ser Gin Glu Pro Glu He Asn Lys Asp Gly Asp 
785 790 795 800 

Arg Glu Val Glu Glu Glu Met Lys Lys His Glu Ser Asn Asn Val Gly 

805 810 815 

Leu Leu Glu Asn Leu Thr Asn Gly Val Thr Ala Gly Asn Gly Asp Asn 

620 825 830 

Gly Leu He Pro Gin Arg Lys Ser Arg Thr Pro Glu Asn Gin Gin Phe 

835 840 845 

Pro Asp Asn Glu Ser Glu Glu Tyr His Arg He Cys Glu Leu Val Ser 

850 855 860 

Asp Tyr Lys Glu Lys Gin Met Pro Lys Tyr Ser Ser Glu Asn Ser Asn 
865 870 875 880 

Pro Glu Gin Asp Leu Lys Leu Thr Ser Glu Glu Glu Ser Gin Arg Leu 

865 890 895 

Glu Gly Ser Giu Asn Gly Gin Pro Glu Leu Glu Asn Phe Met Ala He 

^00 905 
Glu Glu Met Lys Lys His Gly Ser Thr His Val Gly Phe Pro Glu Asn 

920 925 
Leu Thr Asn Gly Ala Thr Ala Gly Asn Gly Asp Asp Gly Leu He Pro 

5^30 935 940 

Pro Arg Lys Ser Arg Thr Pro Glu Ser Gin Gin Phe Pro Asp Thr Glu 

950 955 960 

Asn Glu Glu Tyr His Ser Asp Glu Gin Asn Asp Thr Gin Lys Gin Phe 

S*65 970 975 

Cys Glu Glu Gin Asn Thr Gly He Leu His Asp Glu He Leu He His 

980 985 990 

Glu Glu Lys Gin He Glu Val Val Glu Lys Met Asn Ser Glu Leu Ser 

^95 1000 1005 

Leu Ser Cvs Lvs Lvs Glu T,\/c; A«o ti^^ T.#:.n h-j mi-. 7\^^ o^^ rnx.^ t 

1010 1015 1020 

Arg Glu Glu He Ala Met Leu Arg Leu Glu Leu Asp Thr Met Lys His 
1025 1030 1035 1040 

Gin Ser Gin Leu Pro Arg Thr His Met Val Val Glu Val Asp Ser Met 

10^5 1050 1055 

Pro Ala Ala Ser Ser Val Lys Lys Pro Phe Gly Leu Arg Ser Lys Met 

1060 1065 1070 

Gly Lys Trp Cys Cys Arg Cys Phe Pro Cys Cys Arg Glu Ser Gly Lys 

1075 1080 1085 

Ser Asn Val Gly Thr Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr 

1090 1095 1100 

Leu Arg Ser Lys Met Gly Lys Trp Cys Arg His Cys Phe Pro Cys Cys 
1105 1110 1115 j^^20 

Arg Gly Ser Gly Lys Ser Asn Val Gly Ala Ser Gly Asp His Asp Asp 

1125 1130 1135 

Ser Ala Met Lys Thr Leu Arg Asn Lys Met Gly Lys Trp Cys Cys His 

11^0 1145 1150 

Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Lys Val Gly Ala Trp 

1155 1160 ' 1165 

Gly Asp Tyr Asp Asp Ser Ala Phe Met Glu Pro Arg Tyr His Val Arg 

1170 1175 1180 

Gly Glu Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp Gly Lys Val 
1185 1190 1195 1200 

Pro Arg Lys Asp Leu He Val Met Leu Arg Asp Thr Asp Val Asn Lys 
1205 1210 1215 
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L/^. Asp Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn Gly 

1220 1225 1230 

A-;r 2er Glu Val Val I^ys Leu Leu Leu Asp A.rg Arg Cys Gin Leu Asn 

1235 1240 1245 

Val Leu Asp Asn Lys Lys Arg Thr Ala Leu lie Lys Ala Val Gin Cys 

1250 ' 1255 1260 

'^In Glu Asp Glu Cys Ala Leu I^et Leu Leu Glu His Gly Thr Asp Pro 
j2C5 1270 1275 " 1280 

Asn lie Pro Asp Glu Tyr Gly Asn Thr Thr Leu His Tyr Ala lie Tyr 

1285 1290 1295 

Asn Glu Asp Lys Leu Met Ala Lys Ala Leu Leu Leu Tyr Gly Ala Asp 

1300 ^ 1305 1310 

lie Glu Ser Lys Asn Lys His Gly Leu Thr Pro Leu Leu Leu Gly Val 

1315 1320 1325 

His Glu Gin Lys Gin Gin Val Val Lys Phe Leu lie Lys Lys Lys Ala 

1330 1335 1340 

Asn Leu A.sn Ala Leu Asp Arg Tyr Gly Arg Thr Ala Leu lie Leu Ala 
1345 1350 1355 1360 

Val Cys Cys Gly Ser Ala Ser He Val Ser Leu Leu Leu Glu Gin Asn 

1365 1370 1375 

He Asp Val Ser Ser Gin Asp Leu Ser Gly Gin Thr Ala Arg Glu Tyr 

1380 1385 1390 

Axa Val Ser Ser His His His Val He Cys Gin Leu Leu Ser Asp Tyr 

1395 1400 1405 

Lys Glu Lys Gin Met Leu Lys He Ser Ser Glu Asn Ser Asn Pro Glu 

1410 1415 1420 

Gin Asp Leu Lys Leu Thr Ser Glu Glu Glu Ser Gin Arg Phe Lys Gly 
1425 1430 1435 1440 

Ser Glu Asn Ser Gin Pro Glu Lys Met Ser Gin Glu Pro Glu He Asn 

1445 1450 1455 

Lys Asp Gly Asp Arg Glu Val Glu Glu Glu Met Lys Lys His Glu Ser 

1460 1465 1470 

.-.sn Asn Val Gly Leu Leu Glu Asn Leu Thr Asn Gly Val Thr Ala Gly 

1475 1480 1485 

Asr; Gly Asp Asn Gly Leu He Pro Gin Arg Lys Ser Arg Thr Pro Glu 

1490 1495 1500 

Asn Gin Gin Phe Pro Asp Asn Glu Ser Glu Glu Tyr His Arg He Cys 
1505 1510 1515 1520 

Glu Leu Val Ser Asp Tyr Lys Glu Lys Gin Met Pro Lys Tyr Ser Ser 

1525 1530 1535 

Glu Asn Ser Asn Pro Glu Gin Asp Leu Lys Leu Thr Ser Glu Glu Glu 

1540 1545 1550 

:->er Gin Arg Leu Glu Gly Ser Glu Asn Gly Gin Pro Glu Lys Arg Ser 

1555 1560 1565 

C.i.n Glu Pro Glu He Asn Lys Asp Gly Asp Arg Glu Leu Glu Asn Phe 

1570 1575 1580 

Met A.la He Glu Glu Met Lys Lys His Gly Ser Thr His Val Gly Phe 
1585 1590 1595 1600 

Pro Glu Asn Leu Thr Asn Gly Ala Thr Ala Gly Asn Gly Asp Asp Gly 

1605 1610 1615 

Leu He Pro Pro Arg Lys Ser Arg Thr Pro Glu Ser Gin Gin Phe Pro 

1620 1625 1630 

Asp Thr Glu Asn Glu Glu Tyr His Ser Asp Glu Gin Asn Asp Thr Gin 

1635 1640 1645 

Lys Gin Phe Cys Glu Glu Gin Asn Thr Gly He Leu His Asp Glu He 

1650 1655 1660 

Leu He His Glu Glu Lys Gin He Glu Val Val Glu Lys Met Asn Ser 
:i665 1670 1675 1680 
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Glu Leu Ser Leu Ser Cys Lys Lys Glu Lys Asp lie Leu His Glu Asn 

1685 1690 1695 

Ser Thr Leu Arg Glu Glu lie Ala Met Leu Arg Leu Glu Leu Asp Thr 

1700 1705 1710 

Met Lys His Gin Ser Gin Leu 
1715 

<210> 379 
<211> 656 
<212> PRT 
<213> Homo sapien 

<400> 379 

Met Val Val Glu Val Asp Ser Met Pro Ala Ala Ser Ser Val Lys Lys 

^5 10 15 

Pro Phe Gly Leu Arg Ser Lys Met Gly Lys Trp Cys Cys Arg Cys Phe 

20 25 30 

Pro Cys Cys Arg Glu Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp 

35 40 45 

His Asp Asp Ser Ala Met Lys Thr Leu Arg Ser Lys Met Glv Lvs Tro 

50 55 60 

Cys Arg His Cys Phe Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val 
^5 70 75 80 

Gly Ala Ser Gly Asp His Asp Asp Ser Ala Met Lys Thr Leu Arg Asn 

85 90 95 

Lys Met Gly Lys Trp Cys Cys His Cys Phe Pro Cys Cys Arg Gly Ser 

100 105 110 

Gly Lys Ser Lys Val Gly Ala Trp Gly Asp Tyr Asp Asp Ser Ala Phe 

115 120 125 

Met Glu Pro Arg Tyr His Val Arg Gly Glu Asp Leu Asp Lys Leu His 

130 135 140 

Arg Ala Ala Trp Trp Gly Lys Val Pro Arg Lys Asp Leu lie Val Met 

TCn -ir-r- 

-i-o^ -LOO IbO 

Leu Arg Asp Thr Asp Val Asn Lys Lys Asp Lys Gin Lys Arg Thr Ala 

165 170 175 

Leu His Leu Ala Ser Ala Asn Gly Asn Ser Glu Val Val Lys Leu Leu 

IBO 185 190 

Leu Asp Arg Arg Cys Gin Leu Asn Val Leu Asp Asn Lys Lys Arg Thr 

195 200 205 

Ala Leu He Lys Ala Val Gin Cys Gin Glu Asp Glu Cys Ala Leu Met 

210 215 220 

Leu Leu Glu His Gly Thr Asp Pro Asn He Pro Asp Glu Tyr Gly Asn 
225 230 235 240 

Thr Thr Leu His Tyr Ala He Tyr Asn Glu Asp Lys Leu Met Ala Lys 

245 250 255 

Ala Leu Leu Leu Tyr Gly Ala Asp He Glu Ser Lys Asn Lys His Gly 

260 265 270 

Leu Thr Pro Leu Leu Leu Gly Val His Glu Gin Lys Gin Gin Val Val 

275 280 285 

Lys Phe Leu He Lys Lys Lys Ala Asn Leu Asn Ala Leu Asp Arq Tvr 

290 295 300 

Gly Arg Thr Ala Leu He Leu Ala Val Cys Cys Gly Ser Ala Ser He 
305 310 315 320 

Val Ser Leu Leu Leu Glu Gin Asn He Asp Val Ser Ser Gin Asp Leu 

325 330 335 

Ser Gly Gin Thr Ala Arg Glu Tyr Ala Val Ser Ser His His His Val 

340 345 350 

He Cys Gin Leu Leu Ser Asp Tyr Lys Glu Lys Gin Met Leu Lys He 
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355 360 365 

Ser Ser Glu Asn Ser Asn Pro Glu Gin Asp Leu Lys Leu Thr Ser Giu 

370 375 380 

Glu Giu Ser Gin Arg Phe Lys Gly Ser Glu A.sn Ser Gin Pro Glu Lys 
385 390 395 400 

Met Ser Gin Glu Pro Glu lie Asn Lys Asp Gly Asp Arg Glu Val Glu 

405 410 415 

Glu Glu Met Lys Lys His Giu Ser Asn Asn Val Gly Leu Leu Glu Asn 

420 425 430 

Leu Thr A.sn Gly Val Thr Ala Gly Asn Gly Asp Asn Gly Leu lie Pro 

435 440 445 

G3.n Arg Lys Ser Arg Thr Pro Glu Asn Gin Gin Phe Pro Asp Asn Giu 

450 455 460 

Ser Giu Glu Tyr His Arg lie Cys Giu Leu Val Ser Asp Tyr Lys Glu 
465 470 475 480 

Lys Gin Met Pro Lys Tyr Ser Ser Glu Asn Ser Asn Pro Giu Gin Asp 

485 490 495 

Leu Lys Leu Thr Ser Giu Glu Glu Ser Gin Arg Leu Giu Gly Ser Glu 

500 505 510 

Asn Gly Gin Pro Glu Leu Glu Asn Phe Met Ala lie Giu Glu Met Lys 

515 520 525 

Lys His Giy Ser Thr His Val Gly Phe Pro Glu Asn Leu Thr Asn Gly 

530 535 540 

Ala Thr Ala Giy Asn Gly Asp Asp Giy Leu lie Pro Pro Arg Lys Ser 
^"^45 550 555 560 

Arg Thr Pro Glu Ser Gin Gin Phe Pro Asp Thr Glu Asn Glu Glu Tyr 

565 570 575 

His Ser Asp Glu Gin Asn Asp Thr Gin Lys Gin Phe Cys Glu Glu Gin 

580 585 590 

Asn Thr Giy lie Leu His Asp Glu lie Leu He His Giu Glu Lys Gin 

595 600 605 

Tie Glu Val Val Glu Lys Met Asn Ser Glu Leu Ser Leu Ser Cys Lys 

610 615 620 

Lys Glu Lys Asp He Leu His Glu Asn Ser Thr Leu Arg Glu Glu lie 
625 630 635 640 

Ala Met Leu Arg Leu Glu Leu Asp Thr Met Lys His Gin Ser Gin Leu 
645 650 655 



<210> 380 

<211> 671 

<212> PRT 

<213> Homo sapien 



<400> 380 



Met 


Val 


Val 


Glu 


Val 


Asp 


Ser 


Met 


Pro 


Ala 


Ala 


Ser 


Ser 


Val 


Lys 


Lys 


1 








5 










10 










15 


Pro 


Phe 


Gly 


Leu 
20 


Arg 


Ser 


Lys 


Met 


Gly 
25 


Lys 


Trp 


Cys 


Cys 


Arg 

30 


Cys 


Phe 


Pro 


Cys 


Cys 


Arg 


Glu 


Ser 


Gly 


Lys 


Ser 


Asn 


Val 


Gly 


Thr 


Ser 


Gly 


Asp 






35 










40 










45 






His 


Asp 
50 


Asp 


Ser 


Ala 


Met 


Lys 
55 


Thr 


Leu 


Arg 


Ser 


Lys 
60 


Met 


Giy 


Lys 


Trp 


Cys 


Arg 


His 


Cys 


Phe 


Pro 


Cys 


Cys 


Arg 


Gly 


Ser 


Giy 


Lys 


Ser 


Asn 


Val 


65 










70 










75 










80 


Gly 


Ala 


Ser 


Gly 


Asp 

85 


His 


Asp 


Asp 


Ser 


Ala 
90 


Met 


Lys 


Thr 


Leu 


Arg 
95 


Asn 


Lys 


Met 


Giy 


Lys 


Trp 


Cys 


Cys 


His 


Cys 


Phe 


Pro 


Cys 


Cys 


Arg 


Gly 


Ser 



100 105 110 
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Gly Lys Ser Lys 
115 

Met Glu Pro Arg 

130 

Arg Ala Ala Trp 
145 

Leu Arg Asp Thr 

Leu His Leu Ala 
180 

Leu Asp Arg Arg 
195 

Ala Leu He Lys 
210 

Leu Leu Glu His 
225 

Thr Thr Leu His 

Ala Leu Leu Leu 
260 

Leu Thr Pro Leu 
275 

Lys Phe Leu He 
290 

Gly Arg Thr Ala 
305 

Val Ser Leu Leu 

Ser Gly Gin Thr 
340 

He Cys Gin Leu 
355 

Ser Ser Glu Asn 
370 

Glu Glu Ser Gin 
385 

Met Ser Gin Glu 

Glu Glu Met Lys 
420 

Leu Thr Asn Gly 
435 

Gin Arg Lys Ser 
450 

Ser Glu Glu Tyr 
465 

Lys Gin Met Pro 

Leu Lys Leu Thr 
500 

Asn Gly Gin Pro 
515 

Gly Asp Arg Glu 
530 

His Gly Ser Thr 
545 

Thr Ala Gly Asn 



Val Gly Ala Trp 
120 

Tyr His Val Arg 
135 

Trp Gly Lys Val 
150 

Asp Val Asn Lys 
165 

Ser Ala Asn Gly 

Cys Gin Leu Asn 
200 

Ala Val Gin Cys 
215 

Gly Thr Asp Pro 

230 

Tyr Ala He Tyr 
245 

Tyr Gly Ala Asp 

Leu Leu Gly Val 
280 

Lys Lys Lys Ala 
295 

Leu He Leu Ala 
310 

Leu Glu Gin Asn 
325 

Ala Arg Glu Tyr 

Leu Ser Asp Tyr 
360 

Ser Asn Pro Glu 
375 

TVrg Phe Lys Gly 
390 

Pro Glu He Asn 
405 

Lys His Glu Ser 

Val Thr Ala Gly 
440 

Arg Thr Pro Glu 
455 

His Arg He Cys 
470 

Lys Tyr Ser Ser 
485 

Ser Glu Glu Glu 

Glu Lys Arg Ser 
520 

Leu Glu Asn Phe 
535 

His Val Gly Phe 
550 

Gly Asp Asp Gly 
565 



Gly Asp Tyr Asp 

Gly Glu Asp Leu 
140 

Pro Arg Lys Asp 
155 

Lys Asp Lys Gin 
170 

Asn Ser Glu Val 
185 

Val Leu Asp Asn 

Gin Glu Asp Glu 
220 

Asn He Pro Asp 
235 

Asn Glu Asp Lys 
250 

He Glu Ser Lys 
265 

His Glu Gin Lys 

Asn Leu Asn Ala 
300 

Val Cys Cys Gly 
315 

He Asp Val Ser 
330 

Ala Val Ser Ser 
345 

Lys Glu Lys Gin 

Gin Asp Leu Lys 
380 

Ser Glu Asn Ser 
395 

Lys Asp Gly Asp 
410 

Asn Asn Val Gly 
425 

Asn Gly Asp Asn 

Asn Gin Gin Phe 
460 

Glu Leu Val Ser 
475 

Glu Asn Ser Asn 
490 

Ser Gin Arg Leu 
505 

Gin Glu Pro Glu 

Met Ala He Glu 
540 

Pro Glu Asn Leu 
555 

Leu He Pro Pro 
570 



Asp Ser Ala Phe 
125 

Asp Lys Leu His 

Leu He Val Met 
160 

Lys Arg Thr Ala 
175 

Val Lys Leu Leu 
190 

Lys Lys Arg Thr 
205 

Cys Ala Leu Met 

Glu Tyr Gly Asn 
240 

Leu Met Ala Lys 
255 

Asn Lys His Gly 
270 

Gin Gin Val Val 
285 

Leu Asp Arg Tyr 

Ser Ala Ser He 
320 

Ser Gin Asp Leu 
335 

His His His Val 
350 

Met Leu Lys He 
365 

Leu Thr Ser Glu 

Gin Pro Glu Lys 
400 

Arg Glu Val Glu 
415 

Leu Leu Glu Asn 
430 

Gly Leu He Pro 
445 

Pro Asp Asn Glu 

Asp Tyr Lys Glu 
480 

Pro Glu Gin Asp 
495 

Glu Gly Ser Glu 
510 

He Asn Lys Asp 
525 

Glu Met Lys Lys 

Thr Asn Gly Ala 
560 

Arg Lys Ser Arg 

575 
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Thr 


Pro 


Glu 


Ser 
580 


Gin 


Gin 


Phe 


Pro 


Asp 

585 


Thr 


Glu 


Asn 


Glu 


Glu 
590 


Tyr 


Kis 


Se 


Asp 


Glu 
5 95 


Gin 


Asn 


Asp 


Thr 


Gin 
600 


Lys 


Gin 


Phe 


Cys 


Glu 
605 


Glu 


Gin 


Asn 


Thr 


Gly 
610 


lie 


Leu 


His 


Asp 


Glu 
615 


He 


Leu 


lie 


His 


Glu 
620 


Glu 


Lys 


Gin 


He 


Glu 


Val 


Val 


Glu 


Lys 


Met 


Asn 


Ser 


Glu 


Leu 


Ser 


Leu 


Ser 


Cys 


Ijys 


Lys 


62 b 










630 










635 










640 


Glu 


Lys 


Asp 


He 


Leu 
645 


His 


Glu 


Asn 


Ser 


Thr 
650 


Leu 


Arg 


Glu 


Glu 


He 
655 


Ala 


Mot 


Leu 


Arg 


Leu 
660 


Glu 


Leu 


Asp 


Thr 


Met 
665 


Lys 


His 


Gin 


Ser 


Gin 
670 


Leu 





<210> 381 
<211> 251 
<212> DNA 
<213> Homo sapien 



<400> 381 

ggagaagcgt ctgctggggc aggaaggggt ttccctgccc tctcacctgt ccctcaccaa 60 

ggtaacatgc ttcccctaag ggtatcccaa cccaggggcc tcaccatgac ctctgagggg 120 

ccaatatccc aggagaagca ttggggagtt gggggcaggt gaaggaccca ggactcacac 180 

atcctgggcc tccaaggcag aggagagggt cctcaagaag gtcaggagga aaatccgtaa 240 

caagcagtca g 251 



<210> 382 

<211> 3279 

<212> DNA 

<213> Homo sapiens 

<400> 382 

cttcctgcag cccccatgct ggtgaggggc acgggcagga acagtggacc caacatggaa 60 
cjtgctggagg gtgtcaggaa gtgatcgggc tctggggcag ggaggagggg tggggagtgt 120 
cactgggagg ggacatcctg cagaaggtag gagtgagcaa acacccgctg caggggaggg 180 
gagagccctg cggcacctgg gggagcagag ggagcagcac ctgcccaggc ctgggaggag 240 
gggcctggag ggcgtgagga ggagcgaggg ggctgcatgg ctggagtgag ggatcagggg 300 
cagggcgcga gatggcctca cacagggaag agagggcccc tcctgcaggg cctcacctgg 360 
gccacaggag gacactgctt ttcctctgag gagtcaggag ctgtggatgg tgctggacag 420 
aagaaggaca gggcctggct caggtgtcca gaggctgtcg ctggcttccc tttgggatca 480 
gactgcaggg agggagggcg gcagggttgt ggggggagtg acgatgagga tgacctgggg 540 
gtggctccag gccttgccc.c tgcctgggcc ctcacccagc ctccctcaca gtctcctggc 600 
cctcagtctc tcccctccac tccatcctcc atctggcctc agtgggtcat tctgatcact 660 
qaactgacca tacccagccc tgcccacggc cctccatggc tccccaatgc cctggagagg 720 
ggacatctag tcagagagta gtcctgaaga ggtggcctct gcgatgtgcc tgtgggggca 780 
gcatcctgca gatggtcccg gccctcatcc tgctgacctg tctgcaggga ctgtcctcct 840 
ggaccttgcc ccttgtgcag gagctggacc ctgaagtccc ctccccatag gccaagactg 900 
gagccttgtt ccctctgttg gactccctgc ccatattctt gtgggagtgg gttctggaga 960 
catttctgtc tgttcctgag agctgggaat tgctctcagt catctgcctg cgcggttctg 1020 
agagatggag ttgcctaggc agttattggg gccaatcttt ctcactgtgt ctctcctcct 1080 
ttacccttag ggtgattctg ggggtccact tgtctgtaat ggtgtgcttc aaggtatcac 1140 
atcatggggc cctgagccat gtgccctgcc tgaaaagcct gctgtgtaca ccaaggtggt 1200 
gcattaccgg aagtggatca aggacaccat cgcagccaac ccctgagtgc ccctgtccca 1260 
cccctacctc tagtaaattt aagtccacct cacgttctgg catcacttgg cctttctgga 1320 
tgctggacac ctgaagcttg gaactcacct ggccgaagct cgagcctcct gagtcctact 1380 
gacctgtgct ttctggtgtg gagtccaggg ctgctaggaa aaggaatggg cagacacagg 1440 
tgtatgccaa tgtttctgaa atgggtataa tttcgtcctc tccttcggaa cactggctgt 1500 
ctctgaagac ttctcgctca gtttcagtga gqacacacac aaagacgtgg gtgaccatgt 1560 
tgtttgtggg gtgcagagat gggaggggtg gggcccaccc tggaagagtg gacagtgaca 1620 



wo 01/51633 



PCI/ II SO 1/0 15 74 



133 



caaggtggac actctctaca gatcactgag gataagctgg agccacaatg catgaggcac 1680 
acacacagca aggttgacgc tgtaaacata gcccacgctg tcctgggggc actgggaagc 1740 
ctagataagg ccgtgagcag aaagaagggg aggatcctcc tatgttgttg aaggagggac 1800 
tagggggaga aactgaaagc tgattaatta caggaggttt gttcaggtcc cccaaaccac 1860 
cgtcagattt gatgatttcc tagcaggact tacagaaata aagagctatc atgctgtggt 1920 
ttattatggt ttgttacatt gataggatac atactgaaat cagcaaacaa aacagatgta 1980 
tagattagag tgtggagaaa acagaggaaa acttgcagtt acgaagactg gcaacttggc 2040 
tttactaagt tttcagactg gcaggaagtc aaacctatta ggctgaggac cttgtggagt 2100 
gtagctgatc cagctgatag aggaactagc caggtggggg cctttccctt tggatggggg 2160 
gcatatccga cagttattct ctccaagtgg agacttacgg acagcatata attctccctg 2220 
caaggatgta tgataatatg tacaaagtaa ttccaactga ggaagctcac ctgatcctta 2280 
gtgtccaggg tttttactgg gggtctgtag gacgagtatg gagtacttga ataattgacc 2340 
tgaagtcctc agacctgagg ttccctagag ttcaaacaga tacagcatgg tccagagtcc 2400 
cagatgtaca aaaacaggga ttcatcacaa atcccatctt tagcatgaag ggtctggcat 24 60 
ggcccaaggc cccaagtata tcaaggcact tgggcagaac atgccaagga atcaaatgtc 2520 
atctcccagg agttattcaa gggtgagccc tttacttggg atgtacaggc tttgagcagt 2580 
gcagggctgc tgagtcaacc ttttattgta caggggatga gggaaaggga gaggatgagg 2640 
aagcccccct ggggatttgg tttggtcttg tgatcaggtg gtctatgggg ctatccctac 2700 
aaagaagaat ccagaaatag gggcacattg aggaatgata ctgagcccaa agagcattca 2760 
atcattgttt tatttgcctt cttttcacac cattggtgag ggagggatta ccaccctggg 2820 
gttatgaaga tggttgaaca ccccac^cat agcaccggag atatgagatc aacagtttct 2880 
tagccataga gattcacagc ccagagcagg aggacgctgc acaccatgca ggatgacatg 294 0 
ggggatgcgc tcgggattgg tgtgaagaag caaggactgt tagaggcagg ctttatagta 3000 
acaagacggt ggggcaaact ctgatttccg tgggggaatg tcatggtctt gctttactaa 3060 
gttttgagac tggcaggtag tgaaactcat taggctgaga accttgtgga atgcagctga 3120 
cccagctgat agaggaagta gccaggtggg agcctttccc agtgggtgtg ggacatatct 3180 
ggcaagattt tgtggcactc ctggttacag atactggggc agcaaataaa actgaatctt 3240 
gttttcagac cttaaaaaaa aaaaaaaaaa aaaagtttt 3279 

<210> 383 
<211> 154 
<212> PRT 

<400> 383 

Met Ala Gly Val Arg Asp Gin 

5 

Gly Lys Arg Gly Pro Leu Leu 
20 

.His Cys Phe Ser Ser Glu Glu 
35 

Lys Lys Asp Arg Ala Trp Leu 
50 55 

Pro Leu Gly Ser Asp Cys Arg 
65 70 

Ser Asp Asp Glu Asp Asp Leu 
85 

Trp Ala Leu Thr Gin Pro Pro 
100 

Pro Ser Thr Pro Ser Ser lie 
115 



Gly Gin Gly Ala Arg Trp Pro His Thr 
10 15 

Gin Gly Leu Thr Trp Ala Thr Gly Gly 
25 30 

Ser Gly Ala Val Asp Gly Ala Gly Gin 
40 45 

Arg Cys Pro Glu Ala Val Ala Gly Phe 
60 

Glu Gly Gly Arg Gin Gly Cys Gly Gly 
75 80 

Gly Val Ala Pro Gly Leu Ala Pro Ala 
90 95 

Ser Gin Ser Pro Gly Pro Gin Ser Leu 
105 110 



Trp Pro Gin Trp Val He Leu He Thr 
120 125 
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Glu Leu Thr He Pro Ser Pro Ala His Gly Pro Pro Trp Leu Pro Asn 
130 135 140 

Ala Leu Glu Arg Gly His Leu Val Arg Glu 



<210> 384 

<?.]!> 557 

\ -V- DNA 

■:r:13> Homo sapiens 



<400> 384 

ggatcctct a 

aaagatgtgt 

ggggaagggt 

uctgcctcct 

acttaacctt 

c:tctgtagag 

tccccaagac 

ccttcttatt 

Icaattgtga 

aaaaaaaaaa 



gagcggccgc 
tttgttttgg 
ccct tttgca 
ggccaagcag 
gaaatggaaa 
agcagcattc 
acatcctaaa 
tatgtgaaca 
aaatgaatat 
aaaaaaa 



ctactactac 
act ctctgtg 
ttgccaagtg 
gctggtttgc 
gtcttgcaat 
ccagggacct 
aggtgttgta 
actgtttgtc 
catgcaaata 



taaatt cgcg 
gtcccttcca 
ccataaccat 
aagaatgaaa 
cccatttgca 
tggaaacagt 
atggtgaaaa 
tttttttgta 
aat tatgcga 



gccgcgtcga 
atgctgtggg 
gagcactact 
tgaatgattc 
ggatccgtct 
tggcactgt a 
cgtcttcctt 
tcttttttaa 
tttttttttc 



cgaaga 
tttcca 
ctacca 
tacagc 
gtgcac 
aggtgc 
ctttat 
actgta 
aaagta 



agag 
acca 
tggt 
tagg 
atgc 
ttgc 
tgcc 
aagt 
aaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

557 



<210> 385 
<211> 337 
<212> DNA 
<213> Homo sapiens 

<400> 385 

ttcccaggtg atgtgcgagg gaagacacat ttactatcct tgatggggct gattccttta 60 

qtttctctag cagcagatgg gttaggagga agtgacccaa gtggttgact cctatgtgca 120 

^"v^tcaaagcc atctgctgtc ttcgagtacg gacacatcat cactcctgca ttgttgatca 180 

aaacgtggag gtgcttttcc tcagctaaga agcccttagc aaaagctcga atagacttag 240 

tatcagacag gtccagtttc cgcaccaaca cctgctggtt ccctgtcgtg gtctggatct 300 

L.'tttggccac caattccccc ttttccacat cccggca 337 



<210> 386 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 386 

gqgcccgcta ccggcccagg ccccgcctcg cgagtcctcc tccccgggtg cctgcccgca 60 
gcccgctcgg cccagagggt gggcgcgggg ctgcctctac cggctggcgg ctgtaactca 120 
gcgaccttgg cccgaaggct ctagcaagga cccaccgacc ccagccgcgg cggcggcggc 180 
gcggactttg cccggtgtgt ggggcggagc ggactgcgtg tccgcggacg ggcagcgaag 24 0 
atgttagcct tcgctgccag gaccgtggac cgatcccagg gctgtggtgt aacctcagcc 300 

<210> 387 
<211> 537 
-'212> DNA 

''.:*13> Homo sapiens 
• ■;00> 387 

'qgccgagtc gggcaccaag ggactctttg caggcttcct tcctcggatc atcaaggctg 60 
ccccctcctg tgccatcatg atcagcacct atgagttcgg caaaagcttc ttccagaggc 120 
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tgaaccagga 
ccacggatgg 
gagggggctt 
gcggcccagc 
cttacccacc 
gtttgctgta 
ctgacccttg 



ccqgcttctg 
ggagagggca 
gtttcccttc 
acttcctcag 
ccccaagttc 
gctgggcatg 
ttaattcctt 



ggcggctgaa 
ggaggagacc 
cctcccggcg 
acacaacttc 
aagaccaaat 
tctccaggaa 
aagtctaaag 



aggggcaagg 
cagccaagtg 
acaagctcca 
ttcctgctgc 
cttccagctg 
ccaagaagcc 
atgatgaact 



aggcaaggac 
ccttttcctc 
gggcagggct 
tccagtcgtg 
cccccttcgt 
ctcagcctgg 
tcaaaaaaaa 



cccgtctctc 180 
agcactgagg 24 0 
gtccctctgg 300 
gggatcatca 360 
gtttccctgt 420 
tgtagtctcc 480 
aaaaaaa 537 



<210> 388 
<211> 520 
<212> DNA 

<213> Homo sapiens 



<400> 388 

aggataattt 

tgaggttaaa 

gtttgaagat 

ggaccccctc 

ccaggaaact 

acttccccca 

tcatactcaa 

atctttcctc 

atgaacttgt 



ttaaaccaat 
ccagtttgca 
tgcctcttct 
cccaacatgc 
gctacttgtg 
ccccagaaga 
ttgatggtta 
ttctcattac 
cttattttaa 



caaatgaaaa 
ttcccctaat 
acagcttctg 
cccagcccac 
gacctcacca 
ttagcatccc 
ttagacaatt 
cagtaaaggc 
tggtgggttt 



aaacaaacaa 
gtggaaaaag 
agaattgtgt 
ccctaagcat 
gagaccagga 
atactagact 
ccatttcttt 
tcttggtatc 
tttttctggt 



acaaaaaagg 
taagaggact 
tatttcactt 
ggtcccttgt 
gggtttggtt 
catactcaac 
ctggttatta 
tttctgttgg 



aaatgtcatg 
actcagcact 
gccaagtgaa 
caccaggcaa 
agctcacagg 
tcaactaggc 
taaacagaaa 
aatgatttct 



60 

120 

180 

240 

300 

360 

420 

480 

520 



<210> 389 
<211> 365 
<212> DNA 
<213> Hoiuo sapiens 



<400> 389 

cgttgcccca 

gagttaaggc 

aacgactttc 

aagcctatgg 

cccaggaaac 

tgagggtcag 

gggag 



gtttgacaga 
tggatttcag 
caaataat ct 
ccagctgtct 
cttcagacta 
tggaagaacc 



aggaaaggcg 
atctgcctgg 
ca ccagcgcc 
ttgtgttccc 
ccttcctctg 
tagactccca 



gagcttattc 
ttccagccgc 
tt ccagct ca 
tctcacccgc 
ccttcagcaa 
ttgctagagg 



aaagtctaga 
agtgtgccct 

ctgtcctcac 

ggggcgttgc 

tagaaagggg 



gggagtggag 60 
ctgctccccc 120 

agctgagact 240 
ccacattctc 300 
aagggtgctg 3 60 
365 



<210> 390 
<211> 221 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> (1) . . . (221) 
<223> n - A,T,C or G 



<400> 390 

tgcctctcca tcctggcccc gacttctctg 
tacacggntt ctcatgggtg tggaacatct 
gctctangag tctgancnga ntcgttgccc 
tcaaagtcta gagggagtgg aggagttaag 

<210> 391 
<211> 325 
<212> DNA 

<213> Homo sapiens 



tcaggaaagt ggggatggac cccatctgca 60 

ctgcttgcgg tttcaggaag gcctctggct 120 

cantntgaca naaggaaagg cggagcttat 180 
gctggatttc a 221 
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I > IT. a s c__ I e Zi t u r e 
■■:.^--^> (1: . . . {325; 
<22 3> n - A,T,C or G 

<ACjQ> 391 

tocj.j jcaggt cccgaggcct ccctagagcc tggggccgac tctgtgncga tgcangcttt 60 

ctcLcqcgcc cagcctggag ctgctcctgg catctaccaa caatcagncg aggcgagcag 120 

Lagccagggc actgctgcca acagccagtc cnnataccat catgtnaccc ggtgngctct 180 

riaanttngat ntccanagcc ctacccatcn tagttctgct ctcccaccgg ntaccagccc 240 

c,ac.\ qcccag gaatcctaca gccagtaccc tgtcccgacg tctctaccta ccagtacgat 300 
gagacctcca qctactacta rgacc 325 

<2jC> 392 
<211> 277 
<:212> DNA 

<213,^' Hornc sapier.s 

<220> 

<221> misc_ t eat ure 
<222> (I ) . . . (277) 
<223> II - A,T,C or G 

<400> 392 

atattgttta actcct.tcct ttatatcttt taacattttc atggngaaag gttcacatct 60 

agtctcactt nqgcnagngn ctcctacttg agtctcttcc ccggcctgnn ccaqtngnaa 120 

antaccanga accgncatgn cttaanaacn ncctggtttn tgggttnntc aatgactgca 180 

tgcagtgcac caccclgtcc actacgtgat gctgtaggat taaagtctca cagtgggcgg 240 

ctgaggatac agcgccgcgt cctgtgttgc tggggaa 277 

<210> 393 
<211> 566 
<212> DNA 
<213> Homo sapiens 



<400> 393 

a ctagtccag 

gtgatctaca 

ttgccgggaa 

gagaaggtct 

gaggggtcta 

gggtggtttt 

catttattaa 

ttctgcct ca 

catt ctctgc 

ttttgcctat 



t gtggtggaa 
ttctgaagt t 
cactgcagag 
agtttgtcca 
ggagat ct gt 
caaaagt aga 
t cat ccct gc 
atgtt tactg 
ctgagtttt a 
caaaaaaaaa 



ttcgcggccg 
gt ctgaaaat 
acaatgctgt 
tcagcattat 
cccttttaga 
aa tgtcctgt 
ct gtgtctat 
tgcctttgt t 
att t ttgtcc 
a a a aaa 



cgtcgacgga 
gtcttcatga 
gagtttccaa 
catgatatca 
gacaccttac 
attccgatga 
tattat attc 
tt tgctagtt 
aaagttattt 



caggt cagct 
ttaaattcag 
ccttagccca 
ggactggtta 
ttataatgaa 
tcatcctgta 
atatctctac 
tgtgttgttg 
taatct atac 



gtctggctca 60 

cctaaacgtt 120 

tctgcgggca 180 

cttggttaag 240 

gtatttggga 300 

aacattttat 360 

gctggaaact 420 

a.a3iaa3L3.a.3ia 480 

aattaaaagc 540 
566 



<210> 394 
<211> 384 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eacure 
<222> (1) ... (384 ) 
'^223:- n = A, C or 
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<400> 394 

gaacatacat gtcccggcac ctgagctgca 
tgcaaattng gaccgggcca aggctggact 
gcaggaggac cgggctttaa ggagttttaa 
tcccaagatt atcgggagaa agggggcagt 
gaacatccag tttcctgata aggacgatgg 
^g9g"tacgaa aagaacacag aagctgccag 
tgagcagatg gtttctgagg acgt 

<210> 395 
<211> 399 
<212> DMA 
<213> Homo sapiens 



gtctgacatc atcgccatca cgggcctcgc 60 

gctggagcgt gtgaaggagc tacaggccna 120 

gctgagtgtc actgtagacc ccaaatacca 180 

aattacccaa atccggttgg agcatgacgt 240 

gaaccagccc caggaccaaa ttaccafcac 300 

ggatgctata ctgagaattg tgggtgaact 360 

384 



<400> 395 

ggcaaaactg tgtgacctca ataagacctc 
tctgaccttg gactccaaga cctacatcaa 
tatcagaggt ttcatcattg cggaaattgt 
attcacgtct ttccagtacc ctgagttctc 
ccagctactt gtctgcaatt gtatcttcaa 
caagttctct ttggaaagcc tgggcatctc 
gcagcctggt gagaccatcc aatcccaaat 

<210> 396 
<211> 403 
<212> DNA 
<213> Homo sapiens 



gcagatccaa ggtcaagtat cagaagtgac 60 

cagcctggct atattagatg atgagccagt 120 

ggagtctaag gaaatcatgg cctctgaagt 180 

tatagagttg cctaacacag gcagaattgg 240 

gaataccctg gccatccctt tgactgacgt 300 

ctcactacag acctctgacc atgggacggt 360 
aaaatgcac 399 



<220> 

<221> misc_feature 
<222> (1) . . . (403) 
<223> n - A,T,C or G 



<400> 396 

tggagttntc agtgcaaaca agccataaag 
gacattttca acttctgctc cagctgctga 
agacaaggac aacctgttcc ttcataactc 
actaaaaaaa gtggatgaat aatctggata 
taggaaaatg gagggcctta tgatcagaat 
gtttagggga gggagtgagg gataaaagaa 
atcaaagcag gtgctatcac tcaatgttag 

<210> 397 
<211> 100 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_f eature 
<222> (1) . . . (100) 
<223> n = A, T, C or G 



cttcagtagc aaattactgt ctcacagaaa 60 

taaaacaaat catgtgttta gcttgactcc 120 

tctagagaaa aaaaggagtt gttagtagat 180 

tttttcctaa aaagattcct tgaaacacat 240 

gctagaatta gtccattgtg ctgaagcagg 300 

ggaaaaaaag aagagtgaga aaacctattt 360 

gccctgctct ttt 403 



<400> 397 

actagtncag tgtggtggaa ttcgcggccg cgtcgaccta naanccatct ctatagcaaa 60 
tccatccccg ctcctggttg gtnacagaat gactgacaaa 100 

<210> 398 
<211> 278 
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^'.^13> Homo sapiens 

:?20> 

<221> rrii sc_f eature 
':222> (1 ) , . . (276) 
<223> n = A, T, C or G 



<400> 398 

gcggccgcgt cgacagcagt tccgccagcg 

■ rcacctggac atctggaagt cagcggcctg 

tcactactgt gcctcgacca gtgaggagag 

ctccgggcag cccatccacc tgtggcagtt 

ctatggccgc ttcatrangt ggctcaacaa 

<210> 399 
<211> 298 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_f eature 
<222> (1) . . . (298) 
<223> n - A,T,C or G 



ctcgcccctg ggtggggatg tgctgcacgc 60 
gatgaaagag cggacttcac ctggggcgat 120 
ctggaccgac agcgaggtgg actcatcatg 180 
CGtcaaggag ttgctactca agccccacag 240 
ggagaagg 278 



<400> 399 

acggaggtgg aggaagcgnc cctgggatcg 
qgggtgccng catqgagcgc atgggcgcgg 
ccgagatcga gcgcatgggc ctggtcatgg 
ccggcattga gcgcatgggc ccgctgggcc 
tgggccagac catggagcgc attggctctg 

<210> 400 
<211> 548 
<212> DNA 
<213> Homo sapiens 



anaggatggg tcctgncatt gaccncctcn 60 
gcctgggcca cggcatggat cgcgtgggct 120 
accgcatggg ctccgtggag cgcatgggct 180 
tcgaccacat ggcctccanc attgancgca 240 
gcgtggagcn catgggtgcc ggcatggg 298 



<400> 400 

acatcaacta cttcctcatt ttaaggtatg 
gtacatgtac atgtatgaaa tttccttctc 
caaagaacca cacgcttaga agggtaagag 
tgagtctctt ttttccacgt ttaaggggcc 
tgcagagggc tagagaatta tttcatacag 
tataccctct caccatcccc ttgtctactc 
gttggcccca taattctggg cctttgttgt 
ctttccagtg atctcctacc atgggccccc 
tccccagccc ctcctgcccc agcccacccg 
agcaggt t 



gcagttccct tcatcccctt ttcctgcctt 60 
ttaccgaact ctctccacac atcacaaggt 120 
ggcaccctat gaaatgaaat ggtgatttct 180 
atggcaggac ttagagttgc gagttaagac 240 
gctttgaggc cacccatgtc acttatcccg 300 
tgatgccccc aagatgcaac tgggcagcta 360 
ttgttttaat tacttgggca tcccaggaag 420 
ctcctgggat caagcccctc ccaggccctg 480 
cttgccttgg tgctcagccc tcccattggg 540 

548 



<210> 4 01 
<211> 355 
<212> DNA 
<213> HoTTto sapiens 



<220> 

<221> inisc_f eature 
-;222> (1) . . . (355) 
•^-'223> n = A, T, C or G 
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<400> 401 

actgtttcca tgttatgttt ctacacattg 

tgatgtctcc aagtagtcca ccttcattta 

taagagtggt ggcctatttc agctgctttg 

tataaatgaa tgtgctgaag caaagtgccc 

tttgttttgg actctctgtg gtcccttcca 

CGcttttgca ttgccaagtg ccataaccat 

<210> 402 
<211> 407 
<212> DNA 

<213> Homo sapiens 



ctacctcagt gctcctggaa acttagcttt 60 
actctttgaa actgtatcat ctttgccaag 120 
acaaaatgac tggctcctga cttaacgttc 180 
atggtggcgg cgaagaagan aaagatgtgt 240 
atgctgnggg tttccaacca ggggaagggt 300 
gagcactact ctaccatggn tctgc 355 



<220> 

<221> inisc_f cature 
<222> (1) . . , (407) 
<223> n = A, T, C or G 



<400> 402 

atggggcaag ctgqataaag aaccaagacc cactggagta tgctgtcttc aagaaaccca 60 

tctcacatgc ggtggcatac ataggctcaa aataaaggaa tggagaaaaa tatttcaagc 120 

aaatggaaaa cagaaaaaag caggtgttgc actcctactt tctgacaaaa cagactatgc 180 

gaataaagat aaaaaagaga aggacattac aaaggtggtc ctgacctttg ataaatctca 240 

ttgcttgata ccaacctggg ctgttttaat tgcccaaacc aaaaggataa tttgctgagg 300 

ttgtggagct tctcccctgc agagagtccc tgatctccca aaatttggtt gagatgtaag 360 

gntgattttg ctgacaactc cttttctgaa gttttactca tttccaa 407 



<210> 403 
<211> 303 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> {1} . . . (303) 
<223> n = A, T,C or G 



<400> 403 

cagtatttat 

tcctaagcaa 

tagagaacaa 

gggattggat 

tcttaacaac 



agccnaactg aaaagctagt agcaggcaag 
gagccatggc atggtgaaaa tgcaaaagga 
gacctactca gtcatgaaca aaaaggcaga 
attgtaatta tagagcagga agatgacagt 
gaccgaaacc cattatttac ataaacctcc 



tctcaaatcc aggcaccaaa 60 
gagtctggcc aatctacaaa 120 
caccaacatg gatctcatgg 180 
gatcgtcatt tggcacaaca 24 0 
attcggtaac catgttgaaa 300 

303 



<210> 404 
<211> 225 
<212> DNA 

<213> Homo sapiens 



<400> 404 

aagtgtaact tttaaaaatt tagtggattt tgaaaattct tagaggaaag taaaggaaaa 60 

attgttaatg cactcattta cctttacatg gtgaaagttc tctcttgatc ctacaaacag 120 

acattttcca ctcgtgtttc catagttgtt aagtgtatca gatgtgttgg gcatgtgaat 180 

ctccaagtgc ctgtgtaata aataaagtat ctttatttca ttcat 225 



<210> 405 
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: 1 1 ■ 334 
212/- DNA 

2 r: : - H omo sapiens 



v2z0> 

~^^'::']:- misc_f eature 

<:222> fi) . . . (334) 

- 22 3 n = A, T, C or G 



<A00> 405 

;ctgttat 
Ltcaatacac 
t catccccat 
' tcccagtgc 
clggt gcggt 
cact ct ccac 



actgtgagt t 
ctccccccat 
cccatgccaa 
ctccaggaca 
tgtgcctcca 
t ct ctcanng 



ctactaggaa 
agtgaatcag 
aggaagaccc 
gagtgggt ta 
gcttctgctc 
t ggatcccac 



atcatcaaat ctgagggttg 
cttccagggg gtccagtccc 
tccctccttg gctcacagcc 
tgttttcagc tccatccttg 
agtgcttcat ggacagtgtc 
ccct 



tctggaggac 60 
tctccttact 120 
ttctctaggc 180 
ctgtgagtgt 240 
cagcccatgt 300 
334 



<210> 406 
<211> 216 
<212> DNA 
<213> Homo sapiens 

-:220> 

-^^22 1> mis cofeature 

''-2/2> (1) . . . (216) 

<223> n - A, T, C or G 



<400> 406 

tttcatacct aatgagggag ttganatnac atnnaaccag gaaatgcatg gatctcaang 60 

gaaacaaaca cccaataaac tcggagtggc agactgacaa ctgtgagaca tgcacttgct 120 

acnaaacaca aatttnatgt tgcacccttg tttctacacc tgtgggttat gacaaagaca 180 

actgccaaag aatnttcaag aaggaggact gccant 216 

<210> 4 07 
<211> 413 
<212> DNA 

<213> Homo sapiens 



':4 00> 4 07 

gctgacttgc tagtatcatc tgcattcatt 

gtaaatgcaa taggattaaa aaataaattt 

gtacaacatt gcacccagtg tcagattcta 

cccagaggtc tatgtcctaa tgtgttatgg 

ggaaaattgt catttgtcca tgtgacagtt 

tgccagacag gagaaagtct tcccatgtta 

t.gggagttcc agaaaaagtt aaaacagaca 

<2i0> 408 
<211> 183 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (183) 
<22 3> n - A,T,C or G 



gaagcacaag aacttcatgc cttgactcat 60 
gatatcacat ggaaacagac aaaaaatatt 120 
cacctggcca ctcaggaagc aagagttaat 180 
caaatggatg tcatgcacgt accttcattt 240 
gatacttatt cacatttcat atgggcaacc 300 
aaagacattt attatcttgt tttcctgtca 360 
atgggccagg ttctgtagta aag 413 



^'■4 00> 4 08 
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ggagctngcc ctcaattcct ccatntctat 
tncttaacta gttaatcctt aaagggctan 
cattatcctt ccagtattcn ccttctnttt 
ntt 



gttancatat ttaatgtctt ttgnnattaa 60 
ntaatcctta actagtccct ccattgtgag 120 
tatttactcc ttcctggcta cccatgtact 180 

183 



<210> 409 

<211> 250 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_f eature 
<222> (1) . . . (250) 
<223> n = A,T,C or G 



<400> 409 

cccacgcatg ataagctctt tatttctgta agtcctgcta ggaaatcatc aaatctgacg 60 

gtggtttggg ggacctgaac aaacctcctg taattaatca gctttcagtt tctcccccta 120 

gtccctcctt caacaacata ggaggatcct ccccttcttt ctgctcacgg ccttatctag 180 

gcttcccagt gcccccagga cagcgtgggc tatgtttaca gcgcntcctt gctggggggg 24 0 
ggccntatgc 250 



<210> 410 
<211> 306 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . , (306) 
<223> n = A, T,C or G 



<400> 410 

ggctggtttg 

agtcttgcaa 

cccagggacc 

aaggtgttgt 

nactggttgg 

tcntgc 



caagaatgaa atgaatgatt ctacagctag gacttaacct tgaaatggaa 60 
tcccatttgc aggatccgtc tgtgcacatg cctctgtaga gagcagcatt 120 
ttggaaacag ttggcactgt aaggtgcttg ctccccaaga cacatcctaa 180 
aatggtgaaa accgcttcct tctttattgc cccttcttat ttatgtgaac 24 0 
ctttttttgn atctttttta aactggaaag ttcaattgng aaaatgaata 300 

306 



<210> 411 
<211> 261 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (261) 
<223> n = A, T,C or G 



<400> 411 

agagatattn cttaggtnaa agttcataga gttcccatga actatatgac tggccacaca 



60 



ggatcttttg tatttaagga ttctgagatt ttgcttgagc aggattagat aaggctgttc 120 

tttaaatgtc tgaaatggaa cagatttcaa aaaaaaaccc cacaatctag ggtgggaaca 180 

aggaaggaaa gatgtgaata ggctgatggg caaaaaacca atttacccat cagttccagc 24 0 

cttctctcaa ggngaggcaa a 261 



<210> 412 
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/Jl> 241 
:2i2> DNA 

-'213> Homo sapiens 



^220> 

^221> ir.i sc_ feature 
<222> (1) , . . (241) 
^223> n - A, T, C or G 



<400> 412 

gttcaatgtt acctgacatt tctacaacac cccactcacc gatgtattcg ttgcccagtg 60 

ggaacatacc agcctgaatt tggaaaaaat aattgtgttt cttgcccagg aaatactacg 120 

actgactttg atggctccac aaacataacc cagtgtaaaa acagaagatg tggaggggag 180 

ctgggagatt tcactgggta cattgaattc ccaaactacc cangcaatta cccagccaac 240 
^ 241 

<210> 413 
<2ia> 231 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1) . . . (231) 
<223> n = A, T, C or G 

<400> 413 

aactcttaca atccaagtga ctcatctgtg tgcttgaatc ctttccactg tctcatctcc 60 

ctcatccaag tttctagtac cttctctttg ttgtgaagga taatcaaact gaacaacaaa 120 

aagtttactc tcctcatttg gaacctaaaa actctcttct tcctgggtct gagggctcca 180 

agaatccttg aatcanttct cagatcattg gggacaccan atcaggaacc t 231 

<210> 414 
<211> 234 
<212> DNA 

<213> Homo sapiens 
<400> 414 

actgtccatg aagcactgag cagaagctgg aggcacaacg caccagacac tcacagcaag 60 
gatggagctg aaaacataac ccactctgtc ctggaggcac tgggaagcct agagaaggct 120 
gtgagccaag gagggagggt cttcctttgg catgggatgg ggatgaagta aggagaggga 180 
ctggaccccc tggaagctga ttcactatgg ggggaggtgt attgaagtcc tcca 234 

<210> 415 
<211> 217 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 
<222> (1) . . . (217) 
<223> n = A, T, C or G 

-:4 00:. 415 

gcataggatt aagactgagt atcttttcta cattctttta actttctaag gggcacttct 60 

caaaacacag accaggtagc aaatctccac tgctctaagg ntctcaccac cactttctca 120 

cacf:tagcaa tagtagaatt cagtcctact tctgaggcca gaagaatggt tcagaaaaat 180 

antggattiat aaaaaataac aattaagaaa aatdatc 217 
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416 
213 
DNA 

Homo sapiens 
<220> 

<221> misc_feature 
<222> {1)...{213) 
<223> n - A,T,C or G 



<400> 416 

atgcatatnt aaagganact gcctcgcttt tagaagacat ctggnctgct ctctgcatga 60 
ggcacagcag taaagctctt tgattcccag aatcaagaac tctccccttc agactattac 120 
cgaatgcaag gtggttaatt gaaggccact aattgatgct caaatagaag gatattgact 180 
atattggaac agatggagtc tctactacaa aag 213 



<210> 417 
<211> 303 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> m±sc__feature 
<222> (1) . . . {303) 
<223> n - A,T,C or G 



<400> 417 

nagtcttcag 

gtgggaaagg 

agaagccata 

ttcatctagt 

tcantcaaag 

agt 



gcccatcagg gaagttcaca ctggagagaa 
ctttactctg agttcaaatc ttcaagccca 
caaatgcaat gagtgtggga agagcttcag 
ggtccacaca ggagagaaac cctataaatg 
ttcgtatctt caaatccatc ngaaggncca 



gtcatacata tgtactgtat 60 
tcagagagtc cacactggag 120 
gagggattcc cattatcaag 180 
tgagatatgt gggaagggct 240 
cagtatanan aaacctttta 300 

303 



<210> 418 

<211> 328 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1)...(328) 
<223> n = A,T,C or G 



<400> 418 

tttttggcgg tggtggggca gggacgggac angagtctca ctctgttgcc caggctggag 60 
tgcacaggca tgatctcggc tcactacaac ccctgcctcc catgtccaag cgattcttgt 120 
gcctcagcct tccctgtagc tagaattaca ggcacatgcc accacaccca gctagttttt 180 
gtatttttag tagagacagg gtttcaccat gttggccagg ctggtctcaa actcctnacc 240 
tcagnggtca ggctggtctc aaactcctga cctcaagtga tctgcccacc tcagcctccc 300 
aaagtgctan gattacaggc cgtgagcc 328 



<210> 419 
<211> 389 
<212> DNA 
<213> Homo sapiens 
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<2-"0> 

<22I> mis cofeature 
<222^ (1) ... (389) 
:-223^* n - A, T, C or G 



<4 00> 419 
cc rcct caag 
;^cc'cct gage 
cttgttt cct 
cgagcaaggc 
':cggtt ctcc 
aaaggtagg 
tggcagccac 



acggcctgtg 
catggactgg 
ctct gt ggcr 
caagctggct 
agccaccaac 
accaaagggc 
tcnggctgtg 



gt ccgcctcc 
agcct gaaag 
ccat tcatag 
caaagagcaa 
ctcactcgct 
atctgctttt 
tcgacgcgg 



cggcaaccaa 
gcagcgtaca 
cacagttgtt 
ccagtcaact 
cccgcaaatg 
ctgaagtcct 



gaagcct gca 
ccctgctcct 
gcactgaggc 
ctgccacggt 
gcacatcagt 
ct gctctatc 



gtgccatatq 60 
gatcttgctg 120 
ttgtgcaggc 180 
gtgccaggca 240 
tcttctaccc 300 
agccatcacg 360 
389 



<2i0> 420 
<211> 408 
<212> DNA 

<213> Homo sapiens 



<400> 420 

gttcctccta 

tggccagggc 

gaagtgtact 

gt cecal tga 

qccaactcac 

gatatagaaa 

acgttgaccg 



actcctgcca 
agcaagcctt 
agccaaggag 
caccttt ccc 
ccagctgggc 
at t cttgaat 
gactttgatg 



gaaacagctc 
agcctt ggct 
tt gaagtttg 
act gacccca 
atggagcagc 
gagt cctata 
aagt gctatg 



tcctcaacat 
tcttgtttct 
tgactttggt 
taaaggaatc 
att atgaact 
aacatgaaca 
acaaacctgg 



gagagctgca 
gctttttttc 
gtttcggcat 
ct catggcca 
tggagagtat 
ggtttatatt 
caagcccg 



Gccctcctcc 60 
tggctagacc 120 
ggagaccgaa 180 
caaggatttg 240 
ataagaaaga 300 
cgaagcacag 360 
408 



<210> 421 
<211> 352 
<212> DNA 

<213> Homo sapiens 
<220> 

<:221> misc__f eature 
<222> (1) . . . (352) 
<223> n = A, C or G 



<400> 421 

gctcaaaaat 

gaggagaatg 

ttcactgaca 

ctccttcttg 

ggtgcaacat 

cactccgagt 



ctttttactg 
aggcctggcc 
gaacaggtct 
aagattcttt 
gaaattt ctg 
ttattgggtg 



atnggcatgg 
tgggagccct 
tttttgggtc 
ggcagttgtc 
tttcgtagca 
tt tgtttcct 



ctacacaatc 
gtgcctacta 
cttcttctcc 
tttgtcataa 
agtgcatgtc 
ttgagat cca 



attgactatt 
naagcacatt 
accacnatat 
cccacaggtg 
tcacaagttg 
tgcattt cct 



acggaggcca 60 
agattatcca 120 
acttgcagtc 180 
tagaaacaag 240 
gcangtctgc 300 



gg 



35: 



<210> 422 
<211> 337 
<212> DNA 

<213> Homo sapiens 



<400> 422 

atgccaccat 

jgatgatcga 

gcgatagcaa 

^:jt gaaatggc 

atccqacacc 

get t c:.t t cca 



get ggcaatg 
cggcaaccgt 
ggtgccggcg 
agctgtcgaa 
ggt gcacctg 
ccggt acggc 



cagcgggcgg 
t gcccgaagt 
atcgcggcgg 
ttgatctacc 
gaagccttgc 
tggcctatga 



t cgaaggcct 
tgccgatgcc 
cgtcaatcct 
cgggttatgg 
agcggctggg 
aaattat 



gcatatccag 
agecgaagcg 
ggccaaggtc 
catcggcggg 
gccgacgccg 



cccaagctgg 60 
gtggtcaagg 120 
agccgtqatc 180 
cataagggct 240 
attcaccgac 300 
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<210> 423 
<211> 310 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (310) 
<223> n = A,T,C or G 



<400> 423 

gctcaaaaat ctttttactg atatggcatg gctacacaat cattgactat tagaggccag 60 

aggagaatga ggcctggcct gggagccctg tgcctactan aagcncatta gattatccat 120 

tcactgacag aacaggtctt ttttgggtcc ttcttctcca ccacgatata cttgcagtcc 180 

tccttcttga agattctttg gcagttgtct ttgtcataac ccacaggtgt anaaacaagg 24 0 

gtgcaacatg aaatttctgt ttcgtagcaa gtgcatgtct cacagttgtc aagtctgccc 300 
tccgagttta 310 



<210> 424 
<211> 370 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (370) 
<223> n = A,T,C or G 



<400> 424 

gctcaaaaat 

ggagaatgag 

cactgacaga 

ccttcttgaa 

ggttgaatct 

cacgaaggtg 

tccgtcgacg 



ctttttactg 
gcctggcctg 
acaggtcttt 
gattctttgg 
cctggaactc 
gcaaagatca 



ataggcatgg 
ggagccctgt 
tttgggtcct 
cagttgtctt 
cctcattagg 
caacgctgcc 



ctacacaatc 
gcctactaga 
tcttctccac 
tgtcataacc 
tatgaaatag 
cagganaaca 



attgactatt 
agcacattag 
cacgatatac 
cacaggtgta 
catgatgcat 
ttcattgtga 



agaggccaga 60 
attatccatt 120 
ttqcagtcct 180 
gaaacatcct 240 
tgcataaagt 300 
taagcaggac 360 
370 



<210> 425 
<211> 216 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (216) 
<223> n = A,T,C or G 



<400> 425 

aattgctatn ntttattttg ccactcaaaa taattaccaa aaaaaaaaaa tnttaaatga 60 

taacaacnca acatcaaggn aaananaaca ggaatggntg actntgcata aatnggccga 120 

anattatcca ttatnttaag ggttgacttc aggntacagc acacagacaa acatgcccag 180 
gaggntntca ggaccgctcg atgtnttntg aggagg 216 



<210> 426 
<211> 596 
<212> DNA 
<213> Homo sapiens 
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4 00:- 42 6 
cr. t ccagtga 
r aqr.j q t. cag 

uctgtccttg 
gacatcacgg 
ttdag:jagtt 
aaacgcacac 
ggtggatggc 
at a.:acL cat 
qtcccgctgg 



ggat aaccct 
tgat ggaagg 
rtgct gagtt 
tattt tqatt 
caact ttt aa 
cat ct gcact 
ttggcttttg 
cttttcagct 
atactcgtgg 
tcccat ccca 



gt tgccccgg 
gtgttctgat 
ggcagt agga 
aacct aatgg 
tgaaatgatt 
gataacttct 
gttttgagat 
ttaacccaat 
gcttagaggc 
ggaccttcca 



gccgaggt tc 
catt ccgacr 
cctaatttgt 
cctt cccagc 
tgaagggcca 
tggcagctga 
acaactctta 
ttgcactgcc 
cacagcagat 
tcggcgagt a 



t ccat t aggc 
gccccaaggg 
"caattaagag 
acgactcgga 
ttaagaggca 
gctggtcgga 
atcttttagt 
ttggaagtgt 
gtcattggtc 
cctgggagcc 



t ct gat tgat 
tcgctggcca 
tagatggtga 
ttcagctgga 
cttcccgtta 
gctgtggccc 
catgct tgag 
agccaggaga 
tactgcctga 
cgtgct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

596 



<210> 427 

<211> 107 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<222> (1) . . . (107 ) 

<223> n - A,T,C or G 

•-400> 427 

gaagaattca agttaggttt attcaaaggg cttacngaga atcctanacc caggncccag 60 
cccggqagca gccttanaga gctcctgttt gactgcccgg ctcagng 107 

<210> 428 

<211> 38 

<212> DNA 

<213> Homo sapiens 

<22 0> 

<221> inisc_f eature 
<222> (1) ... (38) 
<223> n = A, T, C or G 



<400> 428 

gaacttccna anaangactt tattcactat tttacatt 

<210> 429 
<211> 544 
<212> DNA 
<213> Homo sapiens 



38 



<400> 429 

ctttgctgga 

attgaagagc 

atatccacga 

tttggatggt 

gcctt ccact 

agatactaag 

tgatgt gcag 

qagt ttagtt 

acctcaacaa 

ttat 



cggaataaaa 
ggct gcagcc 
actcttgaag 
ggct catcac 
t cagttacac 
cccacatttg 
ttaaaaaatc 
caaagcagta 
gttagagaga 



gtggacgcaa 
ctgcggtt ca 
gactttctga 
ct gtagaacc 
ctcactcacc 
agatgcagca 
tgccctttta 
ttcagcgatt 
tatgcatatc 



gcatgacctc 
gatt aaaat c 
tttatccaca 
tgacttggcc 
atcctctcct 
gccatctccc 
tgatgtcctt 
tcaagagaag 
cagggatttt 



ctgatgaggg 
cgagaat t gt 
at caaatcat 
gtggctggaa 
gttggttctg 
ccaatt cctc 
gatgttctca 
ttttttattt 
ttgccaggt g 



cgctgcattt 
at agacgccg 
cggttttcag 
tccactcgtt 
tgctgcttca 
ctgtccatcc 
tcaagcccac 
ttgctttgac 
gtaggagaga 



60 

120 

180 

24 0 

300 

360 

420 

480 

540 

544 
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<211> 507 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc f eature 
<222> (1) . . . (507) 
<223> n - A, T, C or G 



<400> 430 

cttatcncaa tggggctccc aaacttggct gtgcagtgga aactccgggg gaattttgaa 60 

gaacactgac acccatcttc caccccgaca ctctgattta attgggctgc agtgagaaca 120 

gagcatcaat ttaaaaagct gcccagaatg ttntcctggg cagcgttgtg atctttgccn 180 

ccttcgtgac tttatgcaat gcatcatgct atttcatacc taatgaggga gttccaggag 240 

attcaaccag gatgtttcta cncctgtggg ttatgacaaa gacaactgcc aaagaatntt 300 

caagaaggag gactgcaagt atatcgtggt ggagaagaag gacccaaaaa agacctgttc 360 

tgtcagtgaa tggataatct aatgtgcttc tagtaggcac agggctccca ggccaggcct 420 

cattctcctc tggcctctaa tagtcaatga ttgtgtagcc atgcctatca gtaaaaagat 480 
ttttgagcaa aaaaaaaaaa aaaaaaa 507 



<210> 431 
<211> 392 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_feature 
<222> (1) . . . (392) 
<223> n = A,T,C or G 



<400> 431 

gaaaattcag 

aaacaagaaa 

tatcatggct 

aagagatggg 

catcattcca 

acaaaagtga 

gcaatgagtc 



aatggataaa 
gcacttatca 
aaatgtgaga 
aaacaaaatc 
gcattctgag 
tgttgttagg 
tggcttttac 



aacaaatgaa 
ggaggactta 
ttagcacagc 
ccaggagttt 
attagggnga 
taaaatgtac 
tctgctgttt 



gtacaaaata 
caaatggaag 
tgtattattt 
tgtgtgtgga 
ttggggatca 
aacttctgga 
ct 



tttcagattt 
tacactctan 
gtacattgca 
gtcctgggtt 
ttctggagtt 
tctatgcaga 



<210> 432 
<211> 387 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (387) 
<223> n = A, T,C or G 



acatagcgat 60 
aaccatcatc 120 
aacacctaga 180 
ttccaacaga 240 
ggaatgttca 300 
cattgaaggt 360 
392 



<400> 432 

ggtatccnta cataatcaaa tatagctgta gtacatgttt tcattggngt agattaccac 60 
aaatgcaagg caacatgtgt agatctcttg tcttattctt ttgtctataa tactgtattg 120 
ngtagtccaa gctctcggna gtccagccac tgngaaacat gctcccttta gattaacctc 180 
gtggacnctn ttgttgnatt gtctgaactg tagngccctg tattttgctt ctgtctgnga 240 
attctgttgc ttctggggca tttccttgng atgcagagga ccaccacaca gatgacagca 300 
atctgaattg ntccaatcac agctgcgatt aagacatact gaaatcgtac aggaccggga 360 
acaacgtata gaacactgga gtccttt 387 
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'.y..,i.': 433 

•■■-3 '^2.- 381 

<313-^ DNA 

3 .. V ■ Homo sapiens 



<32i> naisc^feature 

333; (1) . . . (281) 
<333> n - A,T,C or G 

■^'lOO.- 4 33 

i.^'c^actagc anagaanact gcttcagggn 

ctgatraaaq aacactaaga gagggacaag 

caqgcnctat ttggqttqgc tggaggagct 

atcgccgtgg c^at^tcctcn ttgntattac 

tnnaaaaccg nia'.acaata atgatagaat 



gtgtaaaatg aaaggcttcc acgcagttat 60 

gctagaagcc gcaggatgtc tacactatag 120 

gtggaaaaca tggagagatt ggcgctggag 180 

accagngagg ntctctgtnt gcccactggt 240 

aggacacaca t 281 



<210> 434 
<211> 484 
<212> DNA 

<213> Homo sapiens 



<4 00> 4 34 

ttttaaaata a-::ca 1 1 t.aqt 
aatttaartc tttcaacttq 
tgttgcaaaa aaaaaaaagt 
tttttcccca tLggaactag 
agctagtcta tcagcatctg 
cagcctgttt ctatcctgtt 
tgctccaatc tgtcacataa 
tttatttttc tatgtgtttt 
ttta 



gctcagtccc tactgagtac 
caatttgcaa ggattacaca 
gtctttgttt aaaattactt 
tcattaaccc atctctgaac 
acaggtgaat tggatggttc 
taataaatta gtttgggttc 
aagtctgtga cttgaagttt 
ttgcaacata tgagtgtttt 



tctttctctc ccctcctctg 60 
tttcactgtg atgtatattg 120 
ggtttgtgaa tccatcttgc 180 
tggtagaaaa acatctgaag 240 
tcagaaccat ttcacccaga 300 
tctacatgca taacaaaccc 360 
agtcagcacc cccaccaaac 420 
gaaaataaag tacccatgtc 480 

484 



-:210> 435 
<211> 424 
<212> DNA 

<213> Homo sapiens 



<400> 435 

gcgccgctca gagcaggtca 
gggtagcttt caatatcgca 
cgatcgggca agtaaacccc 
atqggcctgt ggggaggggg 
cttggagaga ggaaaaaggc 
ggtagagacc tttgggggtc 
ijctatcagaa acttaaactt 
aaac 



ctttctgcct tccacgtcct 
ggttcttact cctctgcctc 
ctccctcgcc gacttcggaa 
caagatagat gagggggagc 
cacaagaggg gctgccaccg 
tggaacctct ggactcccca 
gaggattttc tctgtttttc 



ccttcaagga agccccatgt 60 
tataagctca aacccaccaa 120 
ctggcgagag ttcagcgcag 180 
ggcatggtgc ggggtgaccc 240 
ccactaacgg agatggccct 300 
tgctctaact cccacactct 360 
actcgcaata aattcagagc 420 

424 



<210> 436 
<211> 667 
<212> DNA 
<213> HorPvO sapiens 



'220> 

<221> misc^f eature 
<222> ( 1 ) . . . ( 667 ) 
-.■223> n =- A, T, C or G 
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<400> 436 

accttgggaa nactctcaca atataaaggg tcgtagactt tactccaaat tccaaaaagg 60 

tcctggccat gtaatcctga aagttttccc aaggtagcta taaaatcctt ataagggtgc 120 

agcctcttct ggaattcctc tgatttcaaa gtctcactct caagttcttg aaaacgaggg 180 

cagttcctga aaggcaggta tagcaactga tcttcagaaa gaggaactgt gtgcaccggg 240 

atgggctgcc agagtaggat aggattccag atgctgacac cttctggggg aaacagggct 300 

gccaggtttg tcatagcact catcaaagtc cggtcaacgt ctgtgcttcg aatataaacc 360 

tgttcatgtt tataggactc attcaagaat tttctatatc tctttcttat atactctcca 420 

agttcataat gctgctccat gcccagctgg gtgagttggc caaatccttg tggccatgag 480 

gattccttta tggggtcagt gggaaaggtg tcaatgggac ttcggtctcc atgccgaaac 540 

accaaagtca caaacttcaa ctccttggct agtacacttc ggtctagcca gaaaaaaagc 600 

agaaacaaga agccaaggct aaggcttgct gccctgccag gaggaggggt gcagctctca 660 
tgttgag 667 

<210> 437 
<211> 693 
<212> DNA 
<213> Homo sapiens 

<400> 437 

ctacgtctca accctcattt ttaggtaagg aatcttaagt ccaaagatat taagtgactc 60 

acacagccag gtaaggaaag ctggattggc acactaggac tctaccatac cgggttttgt 120 

taaagctcag gttaggaggc tgataagctt ggaaggaact tcagacagct ttttcagatc 180 

ataaaagata attcttagcc catgttcttc tccagagcag acctgaaatg acagcacagc 240 

aggtactcct ctattttcac ccctcttgct tctactctct ggcagtcaga cctgtgggag 300 

gccatgggag aaagcagctc tctggatgtt tgtacagatc atggactatt ctctgtggac 360 

catttctcca ggttacccta ggtgtcacta ttggggggac agccagcatc tttagctttc 420 

atttgagttt ctgtctgtct tcagtagagg aaacttttgc tcttcacact tcacatctga 480 

acacctaact gctgttgctc ctgaggtggt gaaagacaga tatagagctt acagtattta 54 0 

tcctatttct aggcactgag ggctgtgggg taccttgtgg tgccaaaaca gatcctgttt 600 

taaggacatg ttgcttcaga gatgtctgta actatctggg ggctctgttg gctctttacc 660 

ctgcatcatg tgctctcttg gctgaaaatg acc 693 

<210> 438 
<211> 360 
<212> DNA 
<213> Homo sapiens 

<400> 438 

ctgcttatca caatgaatgt tctcctgggc agcgttgtga tctttgccac cttcgtgact 60 
ttatgcaatg catcatgcta tttcatacct aatgagggag ttccaggaga ttcaaccagg 120 
atgtttctac acctgtgggt tatgacaaag acaactgcca aagaatcttc aagaaggagg 180 
actgcaagta tatctggtgg agaagaagga cccaaaaaag acctgttctg tcagtgaatg 24 0 
gataatctaa tgtgcttcta gtaggcacag ggctcccagg ccaggcctca ttctcctctg 300 
gcctctaata gtcaataatt gtgtagccat gcctatcagt aaaaagattt ttgagcaaac 360 

<210> 439 
<211> 431 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (431) 
<223> n = A,T,C or G 

<400> 439 

gttcctnnta actcctgcca gaaacagctc tcctcaacat gagagctgca cccctcctcc 60 
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t ggccagggc aqcaagcctt agccttggct 

gaagrgract aqccaaggag ttgaagtttg 

gtrccartga caccttrccc actqacccca 

...:c. -dcrcac ccagctgggc argqagcagc 

gaLatagaaa attcttgaat gagtcctata 

acattgaccg gactttgatg agtgctarga 

aatttagtag t 



tcttgtttct gcttttrttc tggctagacc 120 
tgactttqgt gtttcggcat ggagaccgaa 18G 
taaaggaatc ctcatgacca caaqgatttg 240 
attatgaact tggagagtat ataagaaaga 300 
aacatgaaca ggtttatatt cgaagcacag 360 
caaacctggc agcccgtcga cgcggccgcg 420 

431 



<210> 440 

^2i:> 523 

':2:^i2> DNA 

<^'13> Homo 



sapiens 



-:400> 440 

agagataaag cttaggtcaa agttcataga 

ggatcttttg tatttaagga ttctgagatt 

tttaaatgtc tgaaatggaa cagatttcaa 

aggaaggaaa gatgtgaata ggctgatggg 

cttctctcaa ggagaggcaa agaaaggaga 

actggaaaac tgctactatc tgtttttata 

taaaaattaa aacctctttg rgtcccttgg 

acaaaaatca aactttacag aaagatttga 

Vatatatatc atagcaaata agtcatctga 

<210> 441 

<211> 430 

<212> DNA 

<213> Homo sapiens 



gttcccatga actatatgac tggccacaca 60 
ttgcttgagc aggattagat aaggctgttc 120 
aaaaaaaccc cacaatctag ggtgggaaca 180 
caaaaaacca atttacccat cagttccagc 240 
tacagtggag acatctggaa agtttrctcc 300 
tttctgttaa aatatatgag gctacagaac 360 
tcctggaaca tttatgttcc ttttaaagaa 420 
tgtatgtaat acatatagca gctcttgaag 480 
tgagaacaaq eta 523 



<400> 441 

gttcctccta actcctgcca gaaacagctc 
hggccagggc agcaagcctt agccttggct 
gaagtgtact agccaaggag ttgaagtttg 
gtcccattga cacctttccc actgacccca 
gccaactcac ccagctgggc atggagcagc 
gatatagaaa attcttgaat gagtcctata 
acgttgaccg gactttgatg agtgctatga 
aatttagtag 

<210> 442 
<211> 362 
<212> DNA 

--213> Hojno sapiens 



tcctcaacat gagagctgca cccctcctcc 60 
tcttgtttct gctttttttc tggctagacc 120 
tgactttggt gtttcggcat ggagaccgaa 180 
taaaggaatc ctcatggcca caaggatttg 240 
attatgaact tggagagtat ataagaaaga 300 
aacatgaaca ggtttatatt cgaagcacag 360 
caaacctggc agcccgtcga cgcggccgcg 420 

430 



<4 00> 44 2 

ctaaggaatt agtagtgttc ccatcacttg 
tttcctggaa tgacaattat attttaactt 
cttcacttct gatacttgta aattaatctt 
atgtttagaa atggtcattt tacggaaaaa 
aatgaattaa tgttttactt aatttatatt 
tgattatttt ttgttttcat ttaccagaat 
tc 

-210^ 443 
<211> 624 
<212> DNA 

■'21 3^- Homo sapiens 



tttggagtgt gctattctaa aagattttga 60 

tggtggggga aagagttata ggaccacagt 120 

ttattgcact tgttttgacc attaagctat 180 

ttagaaaaat tctgataata gtgcagaata 240 

gaactgtcaa tgacaaataa aaattctttt 300 

aaaaactaag aattaaaagt ttgattacag 360 

362 
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<220> 

<221> misc feature 
<222> (1) .7. (624) 
<223> n - A,T,C or G 



<400> 443 

tttttttttt gcaacacaat atacatcaca 
ttgaaagaat taaattcaga ggaggggaga 
aatgcttatt ttaaaagaaa tgtaaagagc 
tgctggctag tactccggtc ggtgtcagca 
cccaaaccac agaaaatggg gtgaaattgg 
tataaaatat tgtgaataat atcacctact 
taacgcctac aaaacactta aacatagata 
atggtaaaca tccttattat taaagtcaac 
agtacagaga qagqgcactt aaaccaacta 
ngatgcttgt qctgggtcca aatcttggtc 
ttgtccctat ctqctaaaca gate 



gtgaaatgtg taatccttgc aaattgcaag 60 
gaaagagtac tcagtaggga ctgagcacta 120 
agaaagcaat tcaggctacc ctgccttttg 180 
gcacgtggca ttgaacattg caatgtggag 240 
ccaactttct attaacttgg cttcctgttt 300 
tcaaagggca gttatgaggc ttaaatgaac 360 
acataggtgc aagtactatg tatctggtac 420 
gctaaaatga atgtgtgtgc atatgctaat 480 
agggcctgga gggaaggttt cctggaaaga 540 
tactatgacc ttggccaaat tatttaaact 600 

624 



<210> 444 
<211> 425 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^f eature 
<222> (1) , . . (425) 
<223> n - A,T, C or G 



<400> 444 

gcacatcatt nntcttgcat tctttgagaa taagaagatc agtaaatagt tcagaagtgg 60 

gaagctttgt ccaggcctgt gtgtgaaccc aatgttttgc ttagaaatag aacaagtaag 120 

ttcattgcta tagcataaca caaaatttgc ataagtggtg gtcagcaaat ccttgaatgc 180 

tgcttaatqt qaqaggttgg taaaatcctt tgtqcaacac tctaactccc tgaatgtttt 240 

gctgtgctgg gacctgtgca tgccagacaa ggccaagctg gctgaaagag caaccagcca 300 

cctctgcaat ctgccacctc ctgctggcag gatttgtttt tgcatcctgt gaagagccaa 360 

ggaggcacca gggcataagt gagtagactt atggtcgacg cggccgcgaa tttagtagta 420 

gtaga 425 



<210> 445 
<211> 414 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> niisc_f eature 
<222> (1) . . . (414) 
<223> n = A, T, C or G 



<400> 445 

catgtttatg nttttggatt actttgggca cctagtgttt ctaaatcgtc tatcattctt 60 

ttctgttttt caaaagcaga gatggccaga gtctcaacaa actgtatctt caagtctttg 120 

tgaaattctt tgcatgtggc agattattgg atgtagtttc ctttaactag catataaatc 180 

tggtgtgttt cagataaatg aacagcaaaa tgtggtggaa ttaccatttg gaacattgtg 240 

aatgaaaaat tgtgtctcta gattatgtaa caaataacta tttcctaacc attgatcttt 300 

ggatttttat aatcctactc acaaatgact aggcttctcc tcttgtattt tgaagcagtg 360 

tgggtgctgg attgataaaa aaaaaaaaag tcgacgcggc cgcgaattta gtag 414 

<210> 446 
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Homo sapiens 



-y^ 21^^ iiii. s c_f eature 

<222> (1) .7. (631) 

■ r;3> n - A,T,C or G 



</;.;0> 44 6 

ciCciacittaga anaaagtgcc agagaacacc acataccttg tccggaacat tacaatggct 60 

Lctqcatgca tgggaagtgt gagcattcta tcaatatgca ggagccatct tgcaggtgtg 120 

atgctggtt-a tactggacaa cactgtgaaa aaaaggacta cagtgttcta tacgttgttc 180 

ccqgtcctgt acgatttcag tatgtcttaa tcgcagcrgt gattggaaca attcagattg 240 

ctqtcatctg tgtggtggtc ctctgcatca caagggccaa actttaggta atagcattgg 300 

actgagattt gtaaactttc caaccttcca ggaaatgccc cagaagcaac agaattcaca 360 

gacagaaqca aaatacaggg cactacagtt cagacaatac aacaagagcg tccacgaggt 420 

taatctaaag ggagcatgtt tcacagtggc tggactaccg agagcttgga ctacacaata 480 

caqtattata gacaaaagaa taagacaaga gatctacaca tgttgccttg catttgtggt 540 

aatctacacc aatgaaaaca tgtactacag ctatatttga ttatgtatgg atatatttga 600 

aatagtatac attgtcttga tgttttttct g 631 

<210> 447 
;211> 585 
<212> DNA 

<213> Homo sapiens 



<22C> 

<221> misc__f eature 
<222> (1) . . . (585) 
<223> n = A,T,C or G 



<4 00> 4 47 

v.:OT. tgggaaa 

"-oi.ggccat g 

gcctcttctg 

agttcctgaa 

t.gggct gcca 

ccaggtttgt 

gttcatgttt 

gttcataatg 

attcctttat 

ccaaagtcac 



antnt cacaa 
taat cctgaa 
gaatt octet 
aggcaggtat 
gagtaggata 
catagcactc 
ataggactca 
ct gctccatg 
ggggtcagtg 
aaacttcaac 



tataaagggt 
agttttccca 
gatttcaaag 
agcaactgat 
ggattccaga 
atcaaagtcc 
ttcaagaat t 
cccagctggg 
ggaaaggtgt 
t ccttggct a 



cgtagacttt 
aggtagctat 
t ct cactctc 
cttcagaaag 
tgctgacacc 
ggtcaacgtc 
ttctatatct 
tgagttggcc 
caatgggact 
gtacacttcg 



actccaaatt 
aaaatcctta 
aagttcttga 
aggaactgtg 
ttctggggga 
tgtgcttcga 
ctttcttata 
aaatccttgt 
tcggtctcca 
gtcta 



ccaaaaaggt 
taagggtgca 
aaacgagggc 
tgcaccggga 
aacagggctg 
atataaacct 
tactctccaa 
ggccatgagg 
tgccgaaaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

585 



<2I0> 448 
<211> 93 
<212> DKA 

<213> Homo saoiens 



<220> 

<221> misc_f eature 

<222> (1) ... (93) 

^223> n - A, T, C or G 



<400> 448 

Loctcgtggg tcattctgan nnccgaactg accntgccag ccctgccgan gggccnccat 60 
7cicrr-::ctag tgccctiggag agganggggc tag 93 
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<210> 449 
<211> 706 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1) .7. (706) 
<223> n = A, T,C or G 



<400> 449 

ccaagttcat 

ttctganeac 

cctggagagg 

cggggacagc 

gttgggaagg 

gtgctgcaag 

cgacggccag 

cgtacgtaag 

cgacgtggga 

cactgagcag 

aacaggttga 

gcatggatga 



gctntgtgct 
cgaactgacc 
aggtgtctag 
atcctgcaga 
gcgat cggtg 
gcgattaagt 
tgaattgaat 
cttggatcct 
tccncactga 
aagctggagg 
acctgggagg 
cagagtgaaa 



ggacgctgga 
atgccagccc 
tcagagagta 
tggtcgggcg 
cgggcctctt 
tgggtaacgc 
ttaggtgacn 
ctagagcggc 
gagagtggag 
cacaacgcnc 
tggaggttgc 
ctccatctta 



cagggggcaa 
tgccgatggt 
gtcctggaag 
cgtcccattc 
cgctattacg 
cagggttttc 
ctatagaaga 
cgcctactac 
agtgacatgt 
cagacactca 
aatgagctga 
aaaaaaaaaa 



aagcnnttgc 
cctccatggc 
gtggcctctg 
gccattcagg 
ccagctggcg 
ccagtcncga 
gctatgacgt 
tactaaattc 
gctggacnct 
cagctactca 
gatcaggccn 
aaaaaa 



tcgtgggtca 
tccctagtgc 
ngaggagcca 
ctgcgcaact 
aaagggggat 
cgttgtaaaa 
cgcatgcacg 
gcggccgcgt 
gtccatgaag 
ggaggctgag 
ctgcncccca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

706 



<210> 450 
<211> 493 
<212> DNA 
<213> Homo sapiens 



<400> 450 

gagacggagt 

acagttttaa 

aaatgaggct 

agcctaagta 

caagtcaggt 

agagacactg 

tcaagtcaac 

tacacatcag 

gcgaatttag 



gtcactctgt 
aaggtaaaac 
gagaacttta 
taagaacaac 
agtgaaatgg 
tcagagagtt 
acatctgtga 
aatcacctgg 
tag 



tgcccaggct 
aacataaaaa 
caaagggatc 
ctttggggag 
gtggaattaa 
aaaaagtgag 
actcacagac 
agagctttac 



ggagtgcagc 
gaaatatcct 
ttacagacat 
aaaccatcat 
actcaaatta 
tt ctatccat 
caagttctta 
aaactcccat 



aagacactgt 
atagtggaaa 
gtcgccaata 
ttgacagtga 
atcctgccag 
gaggtgattc 
aaccactgtt 
tgccgagggt 



ctaagaaaaa 
taagagagtc 
tcactgcatq 
ggtacaattc 
ctgaaacgca 
cacagtcttc 
caaactctgc 
cgacgcggcc 



60 

120 

180 

240 

300 

360 

420 

480 

493 



<210> 451 
<211> 501 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (501) 
<223> n = A,T,C or G 



<400> 451 

gggcgcgtcc cattcgccat tcaggctgcg caactgttgg gaagggcgat cggtgcgggc 60 
ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct gcaaggcgat taagttgggt 120 
aacgccaggg ttttcccagt cncgacgttg taaaacgacg gccagtgaat tgaatttagg 180 
tgacnctata gaagagctat gacgtcgcat gcacgcgtac gtaagcttgg atcctctaga 24 0 
gcggccgcct actactacta aattcgcggc cgcgtcgacg tgggatccnc actgagagag 300 
tggagagtga catgtgctgg acnctgtcca tgaagcactg agcagaagct ggaggcacaa 360 
cgcnccagac actcacagct actcaggagg ctgagaacag gttgaacctg ggaggtggag 420 



BNSDOCtD -:WO 0151633A? r > 



w o (M /51633 



154 



PCT/l S01/(I1574 



gt "L acaarga gctgagatca ggccnctgcn ccccagcatg gatgacagag 
'ccl i_£iaaaa3 aaaaaaaaaa a 



igaaact cca 



480 
501 



■.Z10> 452 

'^211> 51 

<?.12> DNA 

<213> Homo sapiens 



<220> 

-.22 1> ruisc_ 

^222> (1) 



leature 
{51} 



^;223: 



n - A, T, C or G 



<400> 452 

agacggtttc accnttacaa cnccttttag gatgggnntt ggggagcaag c 

<210> 453 
<211> 317 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc__f eature 
<222> (1) . . . (317) 
<223> n = A,T,C or G 



51 



<400> 453 

tacat cttgc 

acatctgaag 

ttcacGcana 

taacaaaccc 

cccaccaaac 

tacccatgtc 



tttttcccca 
agctagtcta 
cagcctgttt 
tgctccaatc 
tttatttttc 
tttatta 



ttggaactag 
t cagcatctg 
ctatcctgtt 
tgt cacataa 
tatgtgtttt 



t cattaaccc 
gcaagtgaat 
taataaatta 
aagtctgtga 
ttgcaacata 



at ctctgaac 
tggatggttc 
gtttgggttc 
cttgaagttt 
tgagtgtttt 



tggtagaaaa 60 
tcagaaccat 120 
tctacatgca 180 
antcagcacc 240 
gaaaataagg 300 
317 



<210> 454 

<211> 231 

<212> DNA 

<213> Homo sapiens 

<400> 454 

ttcgaggtac aatcaactct cagagtgtag tttccttcta tagatgagtc agcattaata 60 

taagccacgc cacgctcttg aaggagtctt gaattctcct ctgctcactc agtagaacca 120 

agaagaccaa attcttctgc atcccagctt gcaaacaaaa ttgttcttct aggtctccac 180 

ccttcctttt tcagtgttcc aaagctcctc acaatttcat gaacaacagc t 231 

<210> 455 

<211> 231 

<212> DNA 

<213> Horr.o sapiens 

<400> 455 

taccaaagag ggcataataa tcagtctcac agtagggttc accatcctcc aagtgaaaaa 60 
cattgttccg aatgggcttt ccacaggcta cacacacaaa acaggaaaca tgccaagttt 120 
gtttcaacgc attqatgact tctccaagga tcttcctttg gcatcgacca cattcagggg 180 
caaagaattt ctcatagcac agctcacaat acagggctcc tttctcctct a 231 

^;210> 456 
< 211 > 2 31 
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<212> DNA 

<213> Homo sapiens 

<400> 456 

ttggcaggta cccttacaaa gaagacacca taccttatgc gttattaggt ggaataatca 60 

ttccattcag tattatcgtt attattcttg gagaaaccct gtctgtttac tgtaaccttt 120 

tgcactcaaa ttcctttatc aggaataact acatagccac tatttacaaa gccattggaa 180 

cctttttatt tggtgcagct gctagtcagt ccctgactga cattgccaag t 231 

<210> 457 
<211> 231 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> inisc_f eature 
<222> (1) . . . (231) 
<223> n - A,T,C or G 

<400> 457 

cgaggtaccc aggggtctga aaatctctnn tttantagtc gatagcaaaa ttgttcatca 60 
gcattcctta atatgatctt gctataatta gatttttctc cattagagtt catacagttt 120 
tatttgattt tattagcaat ctctttcaga agacccttga gatcattaag ctttgtatcc 180 
agttgtctaa atcgatgcct catttcctct gaggtgtcgc tggcttttgt g 231 

<210> 458 
<211> 231 
<212> DNA 
<213> Homo sapiens 

<400> 458 

aggtctggtt ccccccactt ccactcccct ctactctctc taggactggg ctgggccaag 60 

agaagaggqq tqgttaggga agccgttgag acctgaagcc ccBCcctctPi ncttrcttca 120 

acaccctaac cttgggtaac agcatttgga attatcattt gggatgagta gaatttccaa 180 

ggtcctgggt taggcatttt ggggggccag accccaggag aagaagattc t 231 

<210> 459 
<211> 231 
<212> DNA 
<213> Homo sapiens 

<400> 459 

ggtaccgagg ctcgctgaca cagagaaacc ccaacgcgag gaaaggaatg gccagccaca 60 

ccttcgcgaa acctgtggtg gcccaccagt cctaacggga caggacagag agacagagca 120 

gccctgcact gttttccctc caccacagcc atcctgtccc tcattggctc tgtgctttcc 18 0 

actatacaca gtcaccgtcc caatgagaaa caagaaggag caccctccac a 231 

<210> 460 
<211> 231 
<212> DNA 
<213> Homo sapiens 

<400> 460 

gcaggtataa catgctgcaa caacagatgt gactaggaac ggccggtgac atggggaggg 60 

cctatcaccc tattcttggg ggctgcttct tcacagtgat catgaagcct agcagcaaat 120 

cccacctccc cacacgcaca cggccagcct ggagcccaca gaagggtcct cctgcagcca 180 

gtggagcttg gtccagcctc cagtccaccc ctaccaggct taaggataga a 231 
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-:r^0> 4 61 
^211> 231 
■:2:2> DNA 

Homo sapiens 



''.100> 4 61 

cqagqtttga gaagctctaa tgtgcagggg agccgagaag caggcggcct agggagggtc 60 

gv-^vj*- qtgct c cagaagagtg tgtgcatgcc agaggggaaa caggcgccrg tgtgtcctgg 120 

qtgqggttca gtgaggagtg ggaaattggt tcagcagaac caagccgrtg ggtgaataag 180 

aggqggattc catggcactg atagagccct atagtttcag agctgggaat t 231 



-2 [0> 4 62 
<211> 231 
<212> DNA 
<213> Homo sapiens 



<4 00> 4 62 

aggtaccctc attgtagcca tgggaaaatt 
gggtcatgca agtataaaaa ttaaaaaaaa 
gaagaactgt tagagagacc aacagggtag 
tctagaggag gtatttaatt tcttctcact 

<:^10> 463 
•.2:il> 231 
<2I2> DNA 

<213> Homo sapiens 



gatgttcagt ggggatcagt gaattaaatg 60 

aagacttcat gcccaatctc atatgatgtg 120 

tgggttagag atttccagag tcttacattt 180 

catccagtgt tgtatttagg a 231 



<400> 463 

tactccagcc tggtgacaga gcgagaccct 
actgagtaga caggtgtcct cttggcatgg 
catttgacag gtgtcttttc ctctggacct 
>^9gggsggtg gatcttccag tcgaagcggt 

<210> 464 

<211> 231 

<212> DNA 

<213> Homo sapiens 



atcaccgccc cccaccccac caaaaaaaaa 60 
taagtcttaa gtcccctccc agatctgtga 120 
cggtgtcccc atctgagtga gaaaaggcag 180 
atagaagccc gtgtgaaaag c 231 



<400> 464 

gtactctaag attttatcta 
aaggacatca catatgaaga 
cctgcttcag tgactgtgtg 
ggtgccagcg caccagctag 



agttgccttt tctgggtggg 
atgtttaagt tggaggtggc 
cctgtagtcc cagctactcg 
atgctctgta acttctaggc 



aaagtttaac cttagtgact 60 

aacgtgaatt gcaaacaggg 120 

ggagtctgtg tgaggccagg 180 

cccattttcc c 231 



<210> 4 65 
^:211> 231 
<212> DNA 

<213> Homo sapiens 



<400> 465 

catgttgttg tagctgtggt aatgctggct gcatctcaga cagggttaac ttcagctcct 60 

gtggcaaatt agcaacaaat tctgacatca tatttatggt ttctgtatct ttgttgatga 120 

.^iggatggcac aatttttgct tgtgttcata atatactcag attagttcag ctccatcaga 180 

L^aactggag acatgcagga cattagggta gtgttgtagc tctggtaatg a 231 



<210> 466 
2]l> 231 



< 2 1 2> 



DMA 
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<213> Homo sapiens 



<400> 466 

caggtacctc tttccattgg atactgtgct 
ggccttcgaa cagaacttgc cacataccca 
cctgtgcaat caaatattgt ggagaattcc 
aataatggag accagtccca caagatgaca 

<210> 467 
<211> 311 
<212> DNA 

<213> Homo sapiens 



agcaagcatg ctctccgggg tttttttaat 60 
ggtataatag tttctaacat ttgcccagga 120 
ctagctggag aagtcacaaa gactataggc 180 
accagtcgtt gtgtgcggct g 231 



<400> 467 

gtacaccctg gcacagtcca atctgaactg 

tggtggcttt tctccttttt catcaagact 

tgtgccttaa cagaaggtct tgagattcta 

gcatgggtct ctgcccaagc tcgtaatgag 

tgtgacctgc tgggcctccc aatagactaa 
ctgcagcaga c 

<210> 468 

<211> 3112 

<212> DNA 

<213> Homo sapiens 

<400> 468 

cattgtgttg ggagaaaaac agaggggaga 
aagatctgca tggtgggaag gacctgatga 
tggaaggcac tggatgcctg atgatgaagt 
atgggatggc cagagacaca ggagatgagt 
cgaggacttg gaattgcatg gagctggagc 
gtgaatgtgg atgattqqat qatcatttct 
aaatgggata cacagtatga tctataaagt 
atttgaagga tgaattgaga taatttattt 
catttggtgg aactgagaaa tggcataaca 
gattatcatt caatctcata gttttgtcat 
aaattgaact gttaacaaag gaatctctgg 
tttccattcc agttggcttc ttgggtttgc 
gaagttttaa catttctcca gtgatttttt 
gattaaataa agaacttgag aagaacaggt 
aattttctgg atgggcaata cttatgttca 
attaaatggc aatggacaaa gtgaaaaact 
ggatgttcct tagtcactta aaggagaact 
acctgtgaga ttaaggctct ttgtggggaa 
tccaaagcca acgtcgaatt ttgaaacata 
tagtacatct ttcttatggg atgcacttat 
tttagctctc aaaatggttc attttaagag 
ggaaggacag cattgtggct tggactttat 
ataagaaagg ctgctgactt taccatctga 
aacatcacta gaaacagcaa gatgacaata 
atttccagcc cctttaaata tccacacaca 
ctgggagaaa tgcccggccg ccatcttggg 
ccgcttgtga gggaaggaca ttagaaaatg 
aaaacagatc ctgttgtgga tatttatttg 
aaagtgagca ttaccaatga gaggaaaaca 
atgcaacaaa caaaatggaa tactgtgatg 
accacggggc agagggtcag gattctggcc 



gttcggcact catctttcat gagatggatg 60 
cctcagcagg gagcccagac cagcctgcac 120 
agtgggaatc atttcagtga ctgtcatgtg 180 
actatagcaa ggcggctgtg ggacgtcagt 240 
caggcagtgc cagttggacc caagagaaga 300 

311 



tttgtgtggc tgcagccgag ggagaccagg 60 
tacagagttt gataggagac aattaaaggc 120 
ggactttcaa actggggcac tactgaaacg 180 
tggagcaagc tcaataacaa agtggttcaa 240 
tgaagtttag cccaattgtt tactagttga 300 
catctctgag cctcaggttc cccatccata 360 
gggatatagt atgatctact tcactgggtt 420 
caggtgccta gaacaatgcc cagattagta 4 80 
ccaaatttaa tatatgtcag atgttactat 540 
ggcccaattt atcctcactt gtgcctcaac 600 
tcctgggtaa tggctgagca ccactgagca 660 
tagctgcatc actagtcatc ttaaataaat 720 
tatctcacct ttgaagatac tatgttatgt 780 
ttcattaaac ataaaatcaa tgtagacgca 840 
caggaaatgc tttaaaatat gcagaagata 900 
tagacttttt tttttttttt ggaagtatct 960 
gaaaaatagc agtgagttcc acataatcca 1020 
ggacaaagat ctgtaaattt acagtttcct 1080 
tcaaagctct tcttcaagac aaataatcta 1140 
gaaaaatggt ggctgtcaac atctagtcac 1200 
aaagttttag aatctcatat ttattcctgt 1260 
aaggtcttta ttcaactaaa taggtgagaa 1320 
ggccacacat ctgctgaaat ggagataatt 1380 
taatgtctaa gtagtgacat gtttttgcac 1440 
caggaagcac aaaaggaagc acagagatcc 1500 
tcatcgatga gcctcgccct gtgcctggtc 1560 
aattgatgtg ttccttaaag gatgggcagg 1620 
aacgggatta cagatttgaa atgaagtcac 1680 
gacgagaaaa tcttgatggc ttcacaagac 1740 
acatgaggca gccaagctgg ggaggagata 1800 
ctgctgccta aactgtgcgt tcataaccaa 1860 
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at L ttcat atttctaacc ctcaaaacaa agctgttgta atatctgatc tctacggttc 1920 

cttctjggcc caacattctc catatatcca gccacactca tttttaatat ttagtrccca 1980 

gatctgtact gtgacctttc tacactgtag aataacatta ctcatrrtgt tcaaagaccc 2040 

\t.:ai.qttgc tgcctaatar gtagctgact gtttttccta aggagtgttc tggcccaggg 2100 

gatctgtgaa caggctggga agcatctcaa gatctttcca gggttatact tactagcaca 2160 

jagcatgatc attacggagt gaattatcta atcaacatca tcctcagtgt ctttgcccat 2220 

actgaaattc atttcccact tttgtgccca ttctcaagac ctcaaaatgt cattccatta 2280 

r.-'atcacagg attaactttt ttttttaacc tggaagaatt caatgttaca tgcagctatg 2340 

ggaatttaat tacatatttt gttttccagt gcaaagatga ctaagtcctt tatccctccc 2400 

ctttgtttga ttttttttcc agtataaagt taaaatgctt agccttgtac tgaggctgta 2460 

tMcagccaca gcctctcccc atccctccag ccttatctgt catcaccatc aacccctccc 2520 

.-^tgcacctaa acaaaatcta acttgtaatt ccttgaacat gtcaggcata cattattcct 2580 

tctgcctgag aagctcttcc ttgtctctta aatctagaat gatgtaaagt tttgaataag 2640 

ctgactatct tacttcatgc aaagaaggga cacatatgag attcatcatc acatgagaca 2700 

gcaaatacta aaagtgtaat ttgattataa gagtttagat aaatatatga aatgcaagag 2760 

ccacagaggg aatgtttatg gggcacgttt gtaagcctgg gatgtgaagc aaaggcaggg 2820 

aacctcatag tatcttatat aatatacttc atttctctat ctctatcaca atatccaaca 2880 

agcttttcac agaattcatg cagtgcaaat ccccaaaggt aacctttatc catttcatgg 2940 

tgagtgcgct ttagaatttt ggcaaatcat actggtcact tatctcaact ttgagatgtg 3000 

tttgtccttg tagttaattg aaagaaatag ggcactcttg tgagccactt tagggttcac 3060 

tcctggcaat aaagaattta caaagagcaa aaaaaaaaaa aaaaaaaaaa aa 3112 

<210> 4 69 
< 2 1 1 > 222 9 
<212> DNA 

<213> Homo sapiens 
<400> 469 

agctctttgt aaattcttta ttgccaggag tgaaccctaa agtggctcac aagagtgccc 60 
tatttctttc aattaactac aaggacaaac acatctcaaa gttgagataa gtgaccagta 120 
tgatttgcca aaattctaaa gcgcactcac catgaaatgg ataaaggtta cctttgggga 180 
tttgcactgc atgaattctg tgaaaagctt gttggatatt gtgatagaga tagagaaatg 240 
aagtatatta tataagatac tatgaggttc cctgcctttg cttcacatcc caggcttaca 300 
aacgtgcccc ataaacattc cctctgtggc tcttgcattt catatattta tctaaactct 360 
tataatcaaa tacactttta gtatttgctg tctcatgtga tgatgaatct catatgtgtc 420 
ccttctttgc atgaagtaag atagtcaact tattcaaaac tttacatcat tctagattta 480 
agagacaagg aagagcttct caggcagaag gaataatgta tgcctgacat gttcaaggaa 540 
ttacaagtta gattttgttt aggtgcatgg gaggggttga tggtgatgac agataaggct 600 
ggagggatgg ggagaggctg tggctgtata cagcctcagt acaaggctaa gcattttaac 660 
tttatactgg aaaaaaaatc aaacaaaggg gagggataaa ggacttagtc atctttgcac 720 
tggaaaacaa aatatgtaat taaattccca tagctgcatg taacattgaa ttcttccagg 780 
ttaaaaaaaa agttaatcct gtgatattaa tggaatgaca ttttgaggtc ttgagaatgg 840 
gcacaaaagt gggaaatgaa tttcagtatg ggcaaagaca ctgaggatga tgttgattag 900 
ataattcact ccgtaatgat catgctgtgt gctagtaagt ataaccctgg aaagatcttg 960 
agatgcttcc cagcctgttc acagatcccc tgggccagaa cactccttag gaaaaacagt 1020 
cagctacata ttaggcagca acacgaaggg tctttgaaca aaatgagtaa tgttattcta 1080 
nagtgtagaa aggtcacagt acagatctgg gaactaaata ttaaaaatga gtgtggctgg 1140 
atatatggag aatgttgggc ccagaaggaa ccgtagagat cagatattac aacagctttg 1200 
ttttgagggt tagaaatatg aaatgatttg gttatgaacg cacagtttag gcagcagggc 1260 
cagaatcctg accctctgcc ccgtggttat ctcctcccca gcttggctgc ctcatgtcat 1320 
cacagtattc cattttgttt gttgcatgtc ttgtgaagcc atcaagattt tctcgtctgt 1380 
tttcctctca ttggtaatgc tcactttgtg acttcatttc aaatctgtaa tcccgttcaa 1440 
ataaatatcc acaacaggat ctgttttcct gcccatcctt taaggaacac atcaattcat 1500 
Lttctaatgt ccttccctca caagcgggac caggcacagg gcgaggctca tcgatgaccc 1560 
aagatggcgg ccgggcattt ctcccaggga tctctgtgct tccttttgtg cttcctgtgt 1620 
atgtggatat ttaaaggggc tggaaatgtg caaaaacatg tcactactta gacattatat 1680 
rg'catcttg ctgtttctag tgatgttaat tatctccatt tcagcagatg tgtggcctca 1740 
q.ilaataaag tcagcagcct ttcttatttc tcacctggaa atacatacga ccatttgagg 1800 
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agacaaatgg caaggtgtca gcataccctg 
ttggtttccg agcatcacaa acaccctctc 
cttatttcag tgggctgttg gcaggaacaa 
agtgcctgac acacaccatt ctcttgaggt 
ttcttgggga gcagtggctc acacctgtaa 
ggtcacctga ggtcaggagt tcaagaccag 
ctaaaaatac aaaaattagc tgggcgtgct 
aatggaatt 



aacttgagtt gagagctaca cacaatatta 1860 
tgtttcttca ctgggcacag aattttaata 1920 
atgaagcaat ctacataaag tcactagtgc 1980 
cccctctaga gatcccacag gtcatatgac 2040 
tcccagcact ttgggaggct gaggcaggtg 2100 
cctggccaat atggtgaaac cccatctcta 2160 
ggtgcatgcc tgtaatccca gccccaacac 2220 

2229 



<210> 470 

<211> 2426 

<212> DNA 

<213> Homo sapiens 



<400> 470 

gtaaattctt tattgccagg agtgaaccct 
tcaattaact acaaggacaa acacatctca 
caaaattcta aagcgcactc accatgaaat 
gcatgaattc tgtgaaaagc ttgttggata 
tatataagat actatgaggt tccctgcctt 
ccataaacat tccctctgtg gctcttgcat 
aattacactt ttagtatttg ctgtctcatg 
tgcatgaagt aagatagtca acttattcaa 
aggaagagct tctcaggcag aaggaataat 
ttagattttg tttaggtgca tgggaggggt 
tggggagagg ctgtggctgt atacagcctc 
tggaaaaaaa atcaaacaaa ggggagggat 
caaaatatgt aattaaattc ccatagctgc 
aaaaagttaa tcctgtgata ttaatggaat 
aagtgggaaa tgaatttcag tatgggcaaa 
cactccgtaa tgatcatgct gtgtgctagt 
ttcccagcct gttcacagat cccctgggcc 
catattaqgc agcaacacga agggtctttg 
agaaaggtca cagtacagat ctgggaacta 
ggagaatgtt gggcccagaa ggaaccgtag 
gggttagaaa tatgaaatga tttggttatg 
cctgaccctc tgccccgtgg ttatctcctc 
attccatttt gtttgttgca tgtcttgtga 
ctcattggta atgctcactt tgtgacttca 
atccacaaca ggatctgttt tcctgcccat 
atgtccttcc ctcacaagcg ggaccaggca 
gcggccgggc atttctccca gggatctctg 
atatttaaag gggctggaaa tgtgcaaaaa 
cttgctgttt ctagtgatgt taattatctc 
aaagtcagca gcctttctta tttctcacct 
atggcaaggt gtcagcatac cctgaacttg 
tccgagcatc acaaacaccc tctctgtttc 
tcagtgggct gttggcagga acaaatgaag 
tgacacacac cattctcttg aggtcccctc 
gggagcagtg gctcacacct gtaatcccag 
ctgaggtcag gagttcaaga ccagcctggc 
atacaaaaat tagctgggcg tgctggtgca 
aggcaggaga attgctggaa catgggaggc 
tgcactcgaa cctgggcgac agagtggaac 
ggcatagtca gatacaacgt gggtgggatg 
gggtgacggt tttgcccaac acaatg 



aaagtggctc acaagagtgc cctatttctt 60 
aagttgagat aagtgaccag tatgatttgc 120 
ggataaaggt tacctttggg gatttgcact 180 
ttgtgataga gatagagaaa tgaagtatat 240 
tgcttcacat cccaggctta caaacgtgcc 300 
ttcatatatt tatctaaact cttataatca 360 
tgatgatgaa tctcatatgt gtcccttctt 420 
aactttacat cattctagat ttaagagaca 480 
gtatgcctga catgttcaag gaattacaag 54 0 
tgatggtgat gacagataag gctggaggga 600 
agtacaaggc taagcatttt aactttatac 660 
aaaggactta gtcatctttg cactggaaaa 720 
atgtaacatt gaattcttcc aggttaaaaa 780 
gacattttga ggtcttgaga atgggcacaa 84 0 
gacactgagg atgatgttga ttagataatt 900 
aagtataacc ctggaaagat cttgagatgc 960 
agaacactcc ttaggaaaaa cagtcagcta 1020 
aacaaaatga gtaatgttat tctacagtgt 1080 
aatattaaaa atgagtgtgg ctggatatat 1140 
agatcagata ttacaacagc tttgttttga 1200 
aacgcacagt ttaggcagca gggccagaat 1260 
cccagcttgg ctgcctcatg tcatcacagt 1320 
agccatcaag attttctcgt ctgttttcct 1380 
tttcaaatct gtaatcccgt tcaaataaat 1440 
cctttaagga acacatcaat tcattttcta 1500 
cagggcgagg ctcatcgatg acccaagatg 1560 
tgcttccttt tgtgcttcct gtgtgtgtgg 1620 
catgtcacta cttagacatt atattgtcat 1680 
catttcagca gatgtgtggc ctcagatggt 1740 
ggaaatacat acgaccattt gaggagacaa 1800 
agttgagagc tacacacaat attattggtt 1860 
ttcactgggc acagaatttt aatacttatt 1920 
caatctacat aaagtcacta gtgcagtgcc 1980 
tagagatccc acaggtcata tgacttcttg 2040 
cactttggga ggctgaggca ggtgggtcac 2100 
caatatggtg aaaccccatc tctactaaaa 2160 
tgcctgtaat cccagctact tgggaggctg 2220' 
ggaggttgca gtgagctgta attgtgccat 2280 
tctgtttcca aaaaacaaac aaacaaaaaa 2340 
tgtaaataga agcaggatat aaagggcatg 2400 

2426 



<210> 471 
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< :. '\ K 812 

■:.Z12> DNA 

<213> Homo sapiens 



<400> 471 

gaacaaaatg agtaatgtta ttctacagtg tagaaaggtc acagtacaga tctgggaact 60 

aaatattaaa aatgagtgtg gctggatata tggagaatgt tgggcccaga aggaaccgta 120 

gat cagat attacaacag ctttgttttg agggttagaa atatgaaatg atrtggttat 180 

gaacgcacag tttaggcagc agggccagaa tcctgaccct ctgccccgtg gttatctcct 240 

::cccagcttg gctgcctcat gtcatcacag tattccattt tgtttgttgc atgtcttgtg 300 

logccatcaa gattttctcg tctgttttcc tctcattggt aatgctcact ttgtgacttc 360 

■Tifutcaaatc tgtaatcccg ttcaaataaa tatccacaac aggatctgtt ttcctgccca 420 

tcctttaagg aacacatcaa ttcattttct aatgtccttc cctcacaagc gggaccaggc 480 

acagggcgag gctcatcgat gacccaagat ggcggccggg catttctccc agggatctct 540 

gtgcttcctt ttgtgcttcc tgtgtgtgtg gatatttaaa ggggctggaa atgtgcaaaa 600 

acatgtcact acttagacat tatattgtca tcttgctgtt tctagtgatg ttaattatct 660 

ccatttcagc agatgtgtgg cctcagatgg taaagtcagc agcctttctt atttctcacc 720 

tctgtatcat caggtccttc ccaccatgca gatcttcctg gtctccctcg gctgcagcca 780 
cacaaatctc ccctctgttt ttctgatgcc ag 812 

<210> 472 
<2ai> 515 
<2:i2> DNA 
<213> Homo sapiens 



<220> 

<221> raisc_f eature 
<222> {!)... (515) 
<223> n - A,T,C or G 



<400> 472 

acggagactt attttctgat 
cttatgactt tcctatcatg 
gttccagaat tattggtcct 
caatcaggat attgaacctg 
agtagaaggt gattgccagg 
ctctgatgta aaagagaaga 
agatgggcag ccataagtta 
cattgaaaat gtgactgaaa 
gaaaaaaaaa naaaaaaaaa 



attgtctgca tatgtatgtt 
cttattaata aataatacag 
tgcagcccgg tgaatctcag 
gacaagagag agaaggaaca 
aaatggatct ggaaaagact 
ctccacctaa tcctaagcat 
aaaagaagac aagctgaagc 
atttgaaaat tctctcaata 
aaanaaaaan aaaaa 



tttaagagtc tggaaatagt 60 
cccagagaag atgaaaatgg 120 
caagaggaac caccaactga 180 
cctccgatcg aagaacgtaa 240 
cggagtgagc gtggagatgg 300 
gctaagacta aagaagcagg 360 
tacacacatg gctgatgtca 420 
aagtttgagt tttctctgaa 480 

515 



<210> 473 
<211> 5829 
<212> DNA 

<213> Homo sapiens 



<400> 473 

cgcatgccgg ggaagcccaa gctggctcga agagccacca gccacctgtg caagggtggg 60 

cctggaccag ttggaccagc caccaagctc acctactcaa ggaagcaggg atggccaggt 120 

tgcaacagcc tgagtggctg ccacctgata gctgatggag cagaggcctg aggaaaatca 180 

gatggcacat ttagctcttt aatggatctt aagttaattt ttctataaag cacatggcac 240 

cagtccatgc ctcagagctc gtatggcact gcggaccaca gcaggccgag ttcccaggat 300 

Lgccatccag gggggccttc tgtagccctg gccagacctt gcagaggtgg ctgggtgctc 360 

tttgagcgag ctcggcctcc ctggcatgca caggccccag gtactgacac gctgctctga 420 

gtgagcttgt cctgccttgg ctgccaccta actgctgatg gagcagcggc cttaggaaaa 480 

acaaatggcg ctgtagccca actttagggt agaagaagat gtaccatgtc cggccgctag 540 

:'ggtgactg gtgcacctgc tcctggcgta cccttgcaga ggtgggtggt tgctctttgg 600 

:.'':agctt ggc cttgcctggc atgcacaagc ctcagtgcaa caactgtcct acaaatggag 660 
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acacagagag gaaacaagca gcgggctcag 
agtccatgcc tcgggtcgta tggtactgca 
ggccttcttg aggaggactt tacgttcaag 
agactaagcc ttctgtggcc ctggcgagac 
actcggagca gcgtgtcagt agctggggcc 
aaggagcaac cagccacctc tgcaagggtg 
cacccgctcg aggaagtggg gatggccagg 
tgctgatgga gcagaggcct taagaaaagc 
gaagaagtga tgtacatgtc cggacgctaa 
cctttgcaga ggtggctggt tgctctttga 
tttcagtgca acaactttgc cacaaatgga 
ggagaagggt gtaccctgcc tttggggctc 
gcgggcttct tggttgccag gaggcggacc 
aagtgcagaa agcagccagg attgccatcc 
ttgcaggggt gtctggttgc tctttgagcc 
aggtgctggc acgctgctcc gagtgtgctt 
tgcgtctgga gggggtggac cggccaccaa 
gttcccacag cctgagtggc tgccacctga 
cagatggccc ttggccctac ctttttgtta 
gtgaggttgg tggctgtgcc cccagctcct 
ctttgggccc tcttggcctt gcccagcatg 
aatggagcca tataggggaa acgagcagcc 
ggggctccag tccttgcctc aagggtctta 
gcagaccata ggccgtcttg agagggactt 
gccaccctcg ggactctgcc ttctgtggcc 
ggacaggctc acttggagta gcgtgtcagt 
gggctggctc ggggagcaac cagccacctc 
gccaccagct cacccactcc aggaagccgg 
gccacctaat ggctgaagaa acagaggcct 
ctttatggta gaagagctga tttagcctga 
gcctgctgct ggcgcatccg tgcaaggatg 
gcccggcatg cgcaagcctc agtgcaacaa 
aggagcagct ggctctggag catggtgtgc 
tgggtcgtat gacactgcgg gcttgttggt 
ggaggacttt atgttccagt ccagaaagca 
tctgtgccca ggccagacgt agaatttgac 
tgtcggtccc cggggcctgt gcatgccggg 
ccacctctgt taagggtgtg cctggagcag 
agaagcaggg atggccaggt tccaacatcc 
cagaggcctg aggaaaagca gatggcactg 
tctttttcag ttaatgtctg ttttatcaga 
ctttccattt gcttggtaaa tattcctcca 
gcacatgaga tgggtctcct gaatacagga 
tgccagtctg tgtcttttaa ctggggcatt 
tacatgtgaa atttatcctg tcatgatgtt 
agtttcttta tagtgtcaat ggtctttaca 
ggtttttcct ttctacgttt agtgtctcct 
tttatttctt gtaaggtaaa tatgtggatt 
accccaagaa tcattacttt ttaaaatgca 
atggaaaggc ttgtggcatt agaatcctta 
tatggtcttt gtcatcgagg cagaagtgcc 
aactcttgag tccaagccat ccttttgcct 
taaggcatca tgcgtggcta attttttcac 
tgtcactgtg ttgctctggc tgatctcaaa 
gcctcctaaa gtgctaggat tatatgcatg 
tattttcaaa gtcttattgt aagagccatt 
atatctctga aacttttttt tgacaaattt 
aaactttcta ataagagtta acttagagcc 



gagcagggtg tgtgctgcct ttggggctcc 720 
ggcttcttgg ttgccaagag gcggaccaca 780 
tgcagaaagc agccaaaatt accatccatg 840 
ttaaaatttg tgccaaggca ggacaagctc 900 
tatgcatgcc gggcagggcc gggctggctg 960 
cgcctagtgc aggcggagca tccaccacct 1020 
ttcccacagc ctgagtgtct gccaccttat 1080 
agatggcact gtggccctac ctttagggtg 1140 
ttggtgactg gtacaccggc tcctgctaca 1200 
gccagcttgt ccttgcccgg catgcacaag 1260 
gccatataga ggaaacaaga agcaggttca 1320 
cagtccatgc ctcaggtgtc acatggcact 1380 
acaggccatc ttggggagga ctttgtgttc 1440 
agggggacct tctatagccc tggccaaacc 1500 
ggcttggcct ccctggcatg cacgggcccc 1560 
gtcctgcctt ggctgccacc tctgcggggg 1620 
ccttacccag tcaaggaagt ggatggccat 1680 
tggctgatgg agcaaaggcc ttaggaaaag 1740 
gaagaactga tgttccatgt cctgcagcga 1800 
ggcgcgccct cgcagaggtg actggttgct 1860 
cacaagcctc agtgctacta ctgtgctaca 1920 
atctcaggag caaggtgtat gctgcctttg 1980 
tgtcactgtg ggcttcttgg ttgtcaagag 2040 
tatgttcaag tgcagaaagc agccaggatt 2100 
ctggccaaac ttagaatttg gccgtagaca 2160 
agctggggtc tgtgcatgcc gggcaaggcc 2220 
tgcgggggtg cgcctggagc aggtggagca 2280 
ggtagccagg ttcccaaggc ctgagtgggt 234 0 
tgggaaaacc agatggcact gtggccctac 2400 
ctggcagcgt gtggggttgg tggctggtct 24 60 
gctggttgcc ctttgagcca gcttgccctt 2520 
ctgtgctgca aatggggcca tatagaggaa 2580 
actccctttg ggccttcagt ccatgtctca 2640 
tgccaagagg cagaccacag gtcatcttga 2700 
gccagtggta ccacccaggg gacttgtgct 2760 
aaagtcagga cggtctcagt cagagcggcg 2820 
cagggccggg ctggcttggg gagcaagcag 2880 
gtggagcagc caccaacctc acgcactgaa 2940 
tgagtggctg ccacctgatg gctgatggag 3000 
ctttgtagtg ctgttctttg tctctcttga 3060 
gactaggatt gcaaaccctg ctcttttttg 3120 
tccctttatt ttaagcctat gtgtgtcttt 3180 
caacaatggg tctttactct ttatccaact 324 0 
tagcccattt acatttaagt ttagtattgt 3300 
gctagctttt tatttttccc attagtttgc 3360 
attcgatatg tttttgtagt ggctggtact 3420 
tcaggagctc ttgtaacaca agaatgtgga 34 80 
tatttcttgg gactgtattc tatggccttt 3540 
attcaaatta gcataaaaca tttacagcct 3600 
tttataggat tattttgtgt ttttttgaga 3660 
gtggtttgat cataattcac cacagccctg 3720 
taatctccca accagttgga tctgcaggca 3780 
gttttttttt tttttttgtc gagattatgg 3840 
tgtttgacct caagggatct ttctgccacg 3900 
atacaccatg cctattgtag agtattacat 3960 
tattgccttt ggcctaaata actcaatata 4020 
tggggcgtga tgatgagaga agggggtttg 4080 
atttaagaaa ggaaaaaaca caaattatca 4140 
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gaaaaacaac agtaagatca agtgcaaaag 
tatatgtttg tgactcatgg tggcttttac 
cattraaaga atctacatta tagataacat 
qttattggtt ttgtatgaga ttattctcag 
Ltaatgtagt tcgatgatct tacagcaaag 
ttgactaaaa agttattcaa caggaqttat 
aaaaacttga ggataaaaag atgttggaaa 
jaactacaca atggtgaaga cacattggtg 
aaagggcaat gcaggcaata gaaaaattag 
atcaggtaag tttatttata atttgctttc 
algtatatat attgtgcttc ctctgcctgt 
gaaaaaggtg gcatgtgttt ttactttcaa 
tcaatttttc agagtaatga ttctcactgt 
ataagattac atcctctgac tataaaaatc 
gacattgcac ttggcaatga atttatgggt 
tgagttttta tttccagata catgaaatca 
gtttggaaac cgtctttttg tagaatctgc 
tgctgaaaaa agaaatatct tcactacatg 
caccctgtgg aattccatac ggtgcataga 
acttaggtca tgggccacct ggctgatagc 
tggatgaccg ggccaccctc caccactgcc 
gttcttttca tgtggtcatc ccctgttaga 
accttctgtg tgaacattcc acggcagact 
atgaatgaat atgagagaaa atgaataaat 
tgaggatggt gggtagagga gggtctgttt 
^-ggggcagga gcacaggctt tgaatgcaga 
ttgtgattgt gtgaccttgt gaaagttact 
aaaatggaga taataagatg tcaaaggact 
aaaaaaaaa 

<210> 474 
<211> 1594 
<^212> DNA 

<213> Homo sapiens 
<400> 474 

atttatggat cattaatgcc tctttagtag 
agaataagct tcttgatttg taaaattcta 
aaaatataaa tatatagaca tctcagataa 
aataatagta cttaactaga tgagaataac 
cgtttttcat ttgttgtgtt actcatgttt 
gttttcagaa ttattgtatg cagtcagtat 
tttcacactt ctattactag aaataagaat 
ttcaataaaa ttttttagta aatctgattg 
tcaatattgt tatgtcctag acaccttatc 
Lgcaaatgtg taaaatatca atactttatg 
gctgagagag aaagatgcta gctccgcaag 
cggacacacc gccacgtgga cacatgacca 
accacatgga gacaccgtca cacagtcaca 
cacacagaca tatggagaaa tcacatggac 
acacggagac atcaccacat ggacacactg 
actgtcacat ggacacacca tcacacacat 
tggcacacac actgccacac tgtcacatgg 
acactgcctg tggacacaag gacacacaga 
acacggagac atcaccatgc agatacacca 
ggggggacag cagtggcata ctcatgccta 
ccctccatca acactgccca ccccaggttg 
ggcaaggtga actaatggag tgggtggagg 



ttctgtggca aagatgatga gagtaaagaa 4200 
tttgttcttg aatttctgag tacgggttaa 4260 
tttatugcaa gtaaatgtat ttcaaaattt 4320 
cctacttcat tatcaagcta tattatttta 4380 
ctgaaagctg tatcttcaaa atatgtctat 4440 
tarctataaa aaaaatacaa caggaatata 4500 
aagtaatatt aaatcttaaa aaacatatgg 4560 
aagtacaaaa atataaattg gatctagaag 4620 
tagaaatccc tttaaaggtt agtttgtaaa 4680 
atttatttca ctgcaaatta tattttggat 4740 
cttacagcaa tttgccttgc agagttctag 4800 
aatatttaaa tttccatcat tataacaaaa 4860 
ggagtcattt gattattaag acccgttggc 4920 
ctggaagaaa acctaggaaa tattcgtctg 4980 
aaccactgat ccacttccag tcactatcca 5040 
tatgagttga aactttcttt tgattgagca 5100 
aagtggatat ttggaaccct ttgaggccta 5160 
atgaccacca gcagcagctg gggaaaccag 5220 
atacatcctc ccttcagtcg gcttgggtca 5280 
agtttccaca gaaatgcttc aagatgaaag 5340 
ctgtaagacG atgggacaca caggccacca 5400 
tgggagaaaa tacacctgcc tcatttttgt 5460 
gtcgctaaat gtggatgaag aattgaatga 5520 
ggttcagatc ctgggctgga aggctgtgta 5580 
ttcttgcctt taagtcacta attgtcactt 5640 
ccgactggac tttaattctg gctttactag 5700 
taaaccctct gtgcctgttt ctttatctgt 5760 
gtggtaagaa ttaaatgctt taaaaaaaaa 5820 

5829 



tttagagaaa acgtcaaaag aaatggcccc 60 
tgtcattggc tcaaatttgt atagtatctc 120 
tatatttgaa atagcaaatt cctgttagaa 180 
aggtcgccat tatttgaatt gtctcctatt 240 
tacttatgag ggatatatat aacttccact 300 
gagaatgcaa tttaagtttc cttgatgctt 360 
acagtaatat tggcaaagaa aattgaccag 420 
aaaataaaca ttgcttatgg ctttcttaca 480 
tgaaattacg gcttcaaaat tctaattatg 540 
ttcaagctgg ggcctcttca ggcgtcctgg 600 
ccggagaggg aacaccgcca cattgttaca 660 
gactcacatg tacagacaca cggagacatt 720 
cggacacact ggcatagtca catggacgga 780 
acaccaccac actatcacag ggacacagac 840 
tcacactacc acagggacac gagacatcac 900 
gaacacaccg acacactgcc atatgqacac 960 
acacacctcc acaccatcac accaccacac 1020 
cactgtcaca cagatacaca aaacactgtc 1080 
ccactctggt gccgtctgaa ttaccctgct 1140 
agtgactggc tttcacccca gtagtgattg 1200 
gggctacccc agcccatctt tacaaaacag 1260 
agttggaaga aatcccagcg tcagt^caccg 1320 
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ggatagaatt cccaaggaac cctctttttg 
gccgacaggg tgaatgcctt cttgcttgtc 
ggtgggaaga gtgtggctgt gtactttgaa 
aaaccaacag ttttgtgaat gtgtctggag 
gaagggaaga ggcctggggc tggagtattc 



gaggatggtt tccatttctg gaggcgatct 1380 
ttctggggaa tcagagagag tccgttttgt 1440 
ctcctgtaaa ttctctgact catgtccaca 1500 
gcaagggaag ggccactcag gatctatgtt 1560 
gctt 1594 



<210> 475 
<2ai> 2414 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> {33} 

<223> n-A,T,C or G 



<400> 475 

Gccaacacaa tg^ictttata agaatgcttc 
tgtagattat ttttaaggac aaatctttat 
agctaatatt tcatgctgaa cacattttaa 
tatcattaat qcctctttag tagtttagag 
gcttcttgat ttqtaaaatt ctatgtcatt 
aaatatataq acatctcaga t.aatatattt 
gtacttaact agatgagaat aacaggtcgc 
catttgttgt gttactcatg ttttacttat 
agaagtattg tatgcagtca gtatgagaat 
acttctatta ctagaaataa gaatacagta 
aaaatttttt agtaaatctg attgaaaata 
ttgttatgtc ctagacacct tatctgaaat 
tgtgtaaaat atcaatactt tatgttcaag 
agagaaagat gctagctccg caagccgggg 
caccgccacg tggacacatg accagactca 
tggagacacc gtcacacagt cacacgagca 
gacatatgga gaaatcacac tgacacacca 
agacatcacc acatggacac actgtcacac 
acatggacac accatcacac acatgaacac 
acacactgcc acactgtcac atggacacac 
ccatgtggac acaaggacac acagacactg 
gagacatcac catgcagata caccaccaca 
gatacaccac cacacagaaa tgcggacaca 
acactttcat gtgtcagctg gcggtgtggg 
tacttggaca tatagtgaag gcaaaatttt 
ctgtctcaaa aaaagaaaaa aaaagcaata 
tattatgtag atcggagagc agaggattct 
tacattactc ataattactg atgaactaaa 
attgacctgg agaaatgtag acacaaaaga 
ctataagggc atgcaagaat aaaaataggg 
ctggtcatgt aagtcaactt gtatcggtta 
actgcacata cttcaattgt acattttggt 
agcaacatat taaaatagca aacatatcca 
ttttctactc actttttcct gcctatcccc 
cttccagtca ctatccatga gtttttattt 
tttttcctgt tggagcccaa ggagcaaggg 
cagttactca tgacgtctcc atccaggact 
ggggcatcct tctccatcca gtattggggg 
tcctcctcca tccaggacct gaggggtgtc 
tcccttcatg tttatagtra cttaccatta 
aaaaaaaaaa aaaa 



acntgtgaaa aacaaatatc aaagtcttct 60 
tccatgttta atttatttag ctttccctgt 120 
atgctgtaaa tgtagataat gtaatttatg 180 
aaaacgtcaa aagaaatggc cccagaataa 240 
ggctcaaatt tgtatagtat ctcaaaatat 300 
gaaatagcaa attcctgtta gaaaataata 360 
cattatttga attgtctcct attcgttttt 420 
ggggggatat atataacttc cgctgttttc 480 
gcaatttaag tttccttgat gctttttcac 540 
atattggcaa agaaaattga ccagttcaat 600 
aacattgctt atggctttct tacatcaata 660 
tacggcttca aaattctaat tatgtgcaaa 720 
ctggggcctc ttcaggcgtc ctgggctgag 780 
agggaacacc gccacattgt tacatggaca 840 
catgtacaga cacacggaga cattaccaca 900 
cactggcata gtcacatgga cggacacaca 960 
ccacactatc acagggacac agacacacgg 1020 
taccacaggg acacgagaca tcacactgtc 1080 
accgacacac tgccatatgg acactgccac 1140 
ctccatacca tcacaccacc acacacactg 1200 
tcacacagat acacaaaaca ctgtcacacg 1260 
tggacatagc accagacact ctgccacaca 1320 
ctgccacaca gacaccacca catcgttgcc 1380 
ccccacgact ctgggctcta atcgagaaat 144 0 
tttttatttt ctgggtaacc aagcgcgact 1500 
tactgtgtaa tcgttgacag cataattcac 1560 
gaatgcatga acatatcatt aacatttcaa 1620 
gagaaaccaa gaaattatgg tgatagttat 1680 
accgtaagat gagaaatgtg ttaacacagt 1740 
gagaaaacag gagagttttt caagagcttt 1800 
atttttaaaa ggtttattta catgcaataa 1860 
aattcttggc atttgtagct ctataaaacc 1920 
ttacctttac caccaaagtt ttcttgtgtt 1980 
ccatctcttc cacaggtaac cactgatcca 2040 
ccaaatacat gaaatcatat gaatttctgg 2100 
cagaatgagg aacatgatgt ttcttwccga 2160 
gaggggggca tccttctcca tctaggactg 2220 
tcatccttct ccatccagta ttgggggtca 2280 
cttttctgcg cttccttgga tggcagtctt 234 0 
aatcactgtg ccgttttttc ctaaaataaa 2400 

2414 
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Z10:> 476 

i'' 3 4 3 4 
:::i2> DNA 

213> Homo sapiens 



<^0 0> ^76 

gi qctgca 
^ct ccctttg 
+■ ^j^'caagagg 
gc^^-.^iotagta 
c a a g t c a g g 
gcagggccag 
qgt gqagcag 
ctgagtggct 
get ttgtagt 
agact aggat 

acaacaat gg 
ttagcccatt 
t tgctagctt 
caattcgata 
c! t caggagc 
^-tt attctgg 
att caaatta 
Lttataggat 
gtggtttgat 
t aatctccca 
gttttttttt 
tttgacctca 
acaccatgcc 
ttgcctttgg 
qqgcgtgatg 
\ taagaaagg 
ctgt ggcaaa 
tgttcttgaa 
tattgcaagt 
t: acttcatta 
gaaagct gta 
tctataaaaa 
taatat taaa 
tacaaaaata 
aaatcccttt 
tatttcactg 
acagcaatt t 
atttaaattt 
gtcatttgat 
gaagaaaacc 
ttggaatcct 
ct aggatgaa 
aacttttata 
atgcaat caa 
qaggt t atgg 
■ t.cccata 
aacaaatatc 
-li.V Latttag 
u t . igataat 
aaociaatggc 



aatggggcca 
ggccttcagt 
cagaccacag 
ccacccaggg 
acggtct cag 
gctggcttaa 
ccaccaacct 
gccacctgat 
gctgttcttt 
tgcaaaccct 
tttaagccta 
gt ctttactc 
tacatttaag 
tttatttttc 
tgtttttgta 
tcttgtaaca 
gactgtattc 
gcataaaaca 
tattttgtgt 
cataattcac 
accagttgga 
tttttgtcga 
agggatcttt 
tattgtagag 
cctaaataac 
atgagagaag 
aaaaaacaca 
gatgatgaga 
tttctgagta 
aaatgtattt 
tcaagctata 
tcttcaaaat 
aatacaacag 
tcttaaaaaa 
taaattggat 
aaaggt tagt 
caaatt atat 
gccttgcaga 
ccatcattat 
t attaagacc 
taggaaatat 
gcagat ataa 
gtatattgtt 
ctgatacaca 
gaatctcagc 
agtctgtagc 
gtttccccaa 
aaagtcttct 
ctttccctgt 
gtaat ttatg 
cccagaataa 



tat agaggaa 
coat gtctca 
gtcatcttga 
gacttgtgct 
tcagagcagc 
ggagcaagca 
cacgcactga 
ggctgatgga 
gtctctcttg 
gctctttttt 
tgtgtgtctt 
tt tat ccaac 
tttagtattt 
ccatt agtt t 
gtggctggta 
caagaatgtg 
tatggccttt 
tttacagcct 
ttttttgaga 
cacagccctg 
tct acaagca 
gattatggta 
ctgccacagc 
tattacatt a 
tcaatat aat 
ggggtttgaa 
aattatcaga 
gtaaagaata 
cgggttaaca 
caaaatttgt 
ttattttatt 
atgt ct attt 
gaatataaaa 
cat atggaaa 
ctagaagaaa 
ttgtaaaatc 
ttt ggat atg 
gttctaggaa 
aacaaaatca 
cgtt ggcata 
tcgt ct ggac 
taatgataat 
actgtgcttt 
gacactaaaa 
cactcat ttc 
tt caggtaag 
aaaggttatg 
tgt agattat 
agctaatatt 
tat cattaat 
gcttcttgat 



aggagcagct 
tgggtcgtat 
ggaggacttt 
tctgtggccc 
atgt cggtcc 
gccacctctg 
aagaagcagg 
gcagaggcct 
atctttttca 
gctttccatt 
tgcacatgag 
ttgccagt ct 
gttacatgtg 
gcagtttctt 
ctggtttttc 
gatttatttc 
accccaagaa 
atggaaaggc 
tatggtcttt 
aactcttgag 
taaggcatca 
tcactgtgtt 
ctcctaaagt 
ttttcaaagt 
atct ctgaaa 
actttctaat 
aaaacaacag 
tatgtttgtg 
tttaaagaat 
tattggtttt 
aatgtagttc 
gactaaaaag 
aacttgagga 
ctacacaatg 
gggcaatgca 
aggtaagttt 
tatatat att 
aaaggtggca 
atttttcaga 
agattacat c 
attgcacttg 
taaacaaaac 
gggat taaaa 
agggaaaggg 
tgt aggggct 
atacttaaaa 
acactttata 
ttttaaggac 
tcatgctgaa 
gcctctttag 
ttgtaaaatt 



ggct ctggag 
gacactgcgg 
atgttccagt 
aggccagacg 
ccggggcctg 
ttaggggtgt 
gatggccagg 
gaggaaaagc 
gttaatgtct 
tgcttggtaa 
atgggtctcc 
gtgt cttt ta 
aaatttat cc 
tatagtgtca 
ctttctacgt 
ttgtaaggta 
tcattacttt 
ttgtggcatt 
gtcat cgagg 
tccaagccat 
tgcgtggcta 
gctctggctg 
gctaggatta 
cttattgtaa 
cttttttttg 
aagagttaac 
taagat caag 
actcatggtg 
ctacattata 
gtatgagatt 
gatgatctta 
ttattcaaca 
taaaaagatg 
gtgaagacac 
ggcaatagaa 
atttataatt 
gtgcttcctc 
tgrgttttta 
gtaatgattc 
ctctgactat 
gcaatgaatt 
actcagagaa 
taagtaacta 
tttagatgag 
gcaggagct c 
cccttcagag 
agaatgcttc 
aaat ctttat 
cacattttaa 
tagtttagag 
ctatgtcatt 



catggtgtgc 
gcttgttggt 
ccagaaagca 
tagaatttga 
tgcatgccgg 
gcctggagca 
ttccaacatc 
agatggcact 
gttttatcag 
atattcctcc 
tgaatacagg 
actggggcat 
tgtcatgatg 
atggt cttta 
ttagtgtctc 
aata tgtgga 
ttaaaatgca 
agaatcctta 
cagaagtgcc 
ccttttgcct 
attttttcac 
atctcaaatg 
tatgcatgat 
gagccattta 
acaaattttg 
ttagagccat 
tgcaaaagtt 
gcttttactt 
gataacattt 
attctcagcc 
cagcaaagct 
ggagttatta 
ttggaaaaag 
attggtgaag 
aaattagtag 
tgctttcatt 
tgcctgtctt 
ctttcaaaat 
tcactgtgga 
aaaaatcctg 
tatgggcgct 
actgccaacc 
cagtttatag 
aagctctgct 
cctgtaaaga 
tttct ccatt 
acttgtgaaa 
tccatgttta 
atgctgtaaa 
aaaacgtcaa 
ggctcaaatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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tgtatagtat 
attcctgtta 
attgtctcct 
atataacttc 
tttccttgat 
agaaaattga 
aaaaaaaaaa 



ctcaaaatat 
gaaaataata 
attcgttttt 
cgctgttttc 
gctttttcac 
ccagttcaat 
aaaa 



aaatatatag 
gtacttaact 
catttgttgt 
agaagtattg 
acttctatta 
aaaatttttt 



acatctcaga 
agatgagaat 
gttactcatg 
tatgcagtca 
ctagaaataa 
agtaaatctg 



taatatattt 
aacaggtcgc 
ttttacttat 
gtatgagaat 
gaatacagta 
attgaaaata 



gaaatagcaa 3120 
cattatttga 3180 
ggggggatat 3240 
gcaatttaag 3300 
atattggcaa 3360 
aaaaaaaaaa 3420 
3434 



<210> 477 
<211> 140 
<212> PRT 

<213> Homo sapiens 
<400> 477 

Met Asp Gly His Thr Asp lie Trp Arg Asn His Met Asp Thr Pro Pro 
5 10 15 

His Tyr His Arg Asp Thr Asp Thr Arg Arg His His His Met Asp Thr 
20 25 30 

Leu Ser His Tyr His Arg Asp Thr Arg His His Thr Val Thr Trp Thr 
35 40 45 

His His His Thr His Glu His Thr Asp Thr Leu Pro Tyr Gly His Trp 
50 55 60 

His Thr His Cys His Thr Val Thr Trp Thr His Leu His Thr lie Thr 

70 75 80 

Pro Pro His Thr Leu Pro Val Asp Thr Arg Thr His Arg His Cys His 
85 90 95 

Thr Asp Thr Gin Asn Thr Val Thr Arg Arq His His His Ala Asd Thr 
100 105 110 

Pro Pro Leu Trp Cys Arg Leu Asn Tyr Pro Ala Gly Gly Thr Ala Val 
il5 120 125 

Ala Tyr Ser Cys Leu Ser Asp Trp Leu Ser Pro Gin 
130 135 140 



<210> 478 
<211> 143 
<212> PRT 

<213> Homo sapiens 



<400> 478 

Met Tyr Arg His 



Ser His Gly His 
20 

Gly Glu lie Thr 
35 

His Gly Asp lie 



Thr Glu Thr Leu 
5 

Thr Gly He Val 

Trp Thr His His 
40 

Thr Thr Trp Thr 



Pro His Gly Asp 
10 

Thr Trp Thr Asp 

25 

His Thr lie Thr 
His Cys His Thr 



Thr Val Thr Gin 
15 

Thr Gin Thr Tyr 
30 

Gly Thr Gin Thr 
45 

Thr Thr Gly Thr 
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50 55 60 

Ara Asp lie Thr Leu Ser His Gly His Thr He Thr His Met Asn Thr 

65 7 0 7 5 30 

Pre Thr His Cys His Met Asp Thr Gly Thr His Thr Ala Thr Leu Ser 
85 90 95 

His Gly His Thr Ser Thr Pro Ser His His His Thr His Cys Leu Trp 
100 105 110 

Thr Gin Gly His Thr Asp Thr Val Thr Gin He His Lys Thr Leu Ser 
115 120 125 

hjs Gly Asp He Thr Met Gin He His His His Ser Gly Ala Val 
130 135 140 



<210> 419 
<211> 222 
<212> PRT 

<213> Homo sapiens 
<400> 479 

Met Tyr Arg His Thr Glu Thr Leu Pro His Gly Asp Thr Val Thr Gin 
5 10 15 

Ser His Glu His Thr Gly He Val Thr Trp Thr Asp Thr Gin Thr Tyr 
20 25 30 

Gly Glu He Thr Leu Thr His His His Thr He Thr Gly Thr Gin Thr 
35 40 45 

HJs Gly Asp He Thr Thr Trp Thr His Cys His Thr Thr Thr Gly Thr 
50 55 60 

Arg Asp He Thr Leu Ser His Gly His Thr He Thr His Met Asn Thr 
65 70 75 80 

Pro Thr His Cys His Met Asp Thr Ala Thr His Thr Ala Thr Leu Ser 
85 90 95 

His Gly His Thr Ser He Pro Ser His His His Thr His Cys His Val 
100 105 110 

Asp Thr Arg Thr His Arg His Cys His Thr Asp Thr Gin Asn Thr Val 
115 120 125 

Thr Arg Arg His His His Ala Asp Thr Pro Pro His Gly His Ser Thr 
130 135 140 

Arg His Ser Ala Thr Gin He His His His Thr Glu Met Arg Thr His 
145 150 155 160 

Cys His Thr Asp Thr Thr Thr Ser Leu Pro His Phe His Val Ser Ala 
165 170 175 

Gly Gly Val Gly Pro Thr Thr Leu Gly Ser Asn Arg Glu He Thr Trp 
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180 



185 



190 



Thr Tyr Ser Glu Gly Lys lie Phe Phe Tyr Phe Leu Gly Asn Gin Ala 
195 200 205 



Arg Leu Cys Leu Lys Lys Arg Lys Lys Lys Gin Tyr Thr Val 

210 215 220 



<210> 480 

<211> 144 

<212> PRT 

<213> Homo sapiens 

<400> 480 

Met Glu Pro Tyr Arg 

5 

Cys Cys Leu Trp Gly 
20 

Val Gly Phe Leu Val 

35 

Asp Phe Met Phe Lys 
50 

Leu Cys Leu Leu Trp 
65 

Asp Arg Leu Thr Trp 
85 

Arg Ala Arg Pro Gly 
100 

Val Arg Leu Glu Gin 
115 

Ala Gly Val Ala Arg 
130 



Gly Asn Glu Gin Pro Ser 
10 

Leu Gin Ser Leu Pro Gin 
25 

Val Lys Arg Gin Thr lie 
40 

Cys Arg Lys Gin Pro Gly 
55 

Pro Trp Pro Asn Leu Glu 
70 75 

Ser Ser Val Ser Val Ala 
90 

Trp Leu Gly Glu Gin Pro 
105 

Val Glu Gin Pro Pro Ala 
120 

Phe Pro Arg Pro Glu Trp 
135 



Gin Glu Gin Gly Val 
15 

Gly Ser Tyr Val Thr 
30 

Gly Arg Leu Glu Arg 
45 

Leu Pro Pro Ser Gly 
60 

Phe Gly Arg Arg Gin 
80 

Gly Val Cys Ala Cys 
95 

Ala Thr Ser Ala Gly 
110 

His Pro Leu Gin Glu 
125 



Val Pro Pro Asn Gly 
140 



<210> 481 
<211> 167 
<212> PRT 
<213> Homo sapiens 

<400> 481 

Met His Gly Pro Gin Val Leu Ala Arg Cys Ser Glu Cys Ala Cys Pro 
5 10 15 

Ala Leu Ala Ala Thr Ser Ala Gly Val Arg Leu Glu Gly Val Asp Arg 
20 25 30 
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Pro Fro Thr Leu Pro Ser Gin Gly Ser Gly Trp Pro Cys Ser His Ser 
35 40 45 

Leu Ser Gly Cys His Leu Met Ala Asp Gly Ala Lys Ala Leu Gly Lys 
50 55 60 

Ala Asp Gly Pro Trp Pro Tyr Leu Phe Val Arg Arg Thr Asp Val Pro 
65 ' '70 75 80 

Cys Pro Ala Ala Ser Glu Val Gly Gly Cys Ala Pro Ser Ser Trp Arg 
85 90 95 

Ala Leu Ala Glu Val Thr Gly Cys Ser Leu Gly Pro Leu Gly Leu Ala 
100 105 110 

Gin His Ala Gin Ala Ser Val Leu Leu Leu Cys Tyr Lys Trp Ser His 
115 120 125 

lie Gly Glu Thr Ser Ser His Leu Arg Ser Lys Val Tyr Ala Ala Phe 
130 135 140 

Gly Gly Ser Ser Pro Cys Leu Lys Gly Leu Met Ser Leu Trp Ala Ser 
14 5 150 155 160 

Trp Leu Ser Arg Gly Arg Pro 
165 



<210> 482 
<211> 143 
<212> PRT 

<213> Homo sapiens 
<400> 482 

Met Glu Pro Tyr Arg Gly Asn Lys Lys Gin Val Gin Glu Lys Gly Val 
5 10 15 

Pro Cys Leu Trp Gly Ser Ser Pro Cys Leu Arg Cys His Met Ala Leu 
20 25 30 

Arg Ala Ser Trp Leu Pro Gly Gly Gly Pro Gin Ala lie Leu Gly Arg 
35 40 45 

Thr Leu Cys Ser Ser Ala Glu Ser Ser Gin Asp Cys His Pro Gly Gly 
50 55 60 

Pro Ser lie Ala Leu Ala Lys Pro Cys Arg Gly Val Trp Leu Leu Phe 
65 70 75 80 

Glu Pro Ala Trp Pro Pro Trp His Ala Arg Ala Pro Gly Ala Gly Thr 
85 90 95 

Leu Leu Arg Val Cys Leu Ser Cys Leu Gly Cys His Leu Cys Gly Gly 
100 105 110 



Ala Ser Gly Gly Gly Gly Pro Ala Thr Asn Leu Thr Gin Ser Arg Lys 
115 120 125 
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Trp Met Ala Met Phe Pro Gin Pro Glu Trp Leu Pro Pro Asp Gly 
130 135 140 

<210> 483 
<211> 143 

<212> PRT 

<213> Homo sapiens 

<400> 483 

Met Glu Thr Gin Arg Gly Asn Lys Gin Arg Ala Gin Glu Gin Gly Val 
5 10 15 

Cys Cys Leu Trp Gly Ser Ser Pro Cys Leu Gly Ser Tyr Gly Thr Ala 
20 25 30 

Gly Phe Leu Val Ala Lys Arg Arg Thr Thr Gly Leu Leu Glu Glu Asp 
35 40 45 

Phe Thr Phe Lys Cys Arg Lys Gin Pro Lys Leu Pro Ser Met Arg Leu 
50 55 60 

Ser Leu Leu Trp Pro Trp Arg Asp Leu Lys Phe Val Pro Arg Gin Asp 
65 70 75 80 

Lys Leu Thr Arg Ser Ser Val Ser Val Ala Gly Ala Tyr Ala Cys Arg 
85 90 95 

Ala Gly Pro Gly Trp Leu Lys Glu Gin Pro Ala Thr Ser Ala Arg Val 
100 105 110 

Arg Leu Val Gin Ala Glu His Pro Pro Pro His Pro Leu Glu Glu Val 
115 120 125 

Gly Met Ala Arg Phe Pro Gin Pro Glu Cys Leu Pro Pro Tyr Cys 
130 135 140 

<210> 484 

<211> 30 

<212> PRT 

<213> Homo Sapien 

<400> 484 

Thr Ala Ala Ser Asp Asn Phe Gin Leu Ser Gin Gly Gly Gin Gly Phe 

1 5 10 15 

Ala lie Pro lie Gly Gin Ala Met Ala lie Ala Gly Gin lie 
20 25 30 

<210> 485 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 

<400> 485 

gggaagctta tcacctatgt gccgcctctg c 31 
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<210> 486 

<.?!!> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 

<400> 486 

qcpriattctc acgctgagta tttggcc 27 

<210> 487 
<211> 36 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Made in a lab 
<400> 487 

cccgaattct tagctgccca tccgaacgcc ttcatc 36 

<210> 488 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 488 

^(.^ga<igcttc ttccccggct gcaccagctg tgc 33 

<210> 489 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 489 

Met Asp Arg Leu Val Gin Arg Phe Gly Thr Arg Ala Val Tyr Leu Ala 

i 5 10 15 

S&x Val Ala 



<210> 490 
<211> 20 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Made in a lab 



<400> 490 

Tyi Leu Ala Ser Val Ala Ala Phe Pro Val Ala Ala Gly Ala Thr Cys 
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15 10 15 

Leu Ser His Ser 
20 

<210> 491 
<211> 20 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 491 

Thr Cys Leu Ser His Ser Val Ala Val Val Thr Ala Ser Ala Ala Leu 

15 10 15 

Thr Gly Phe Thr 
20 

<210> 492 
<21I> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 492 

Ala Leu Thr Gly Phe Thr Phe Ser Ala Leu Gin He Leu Pro Tyr Thr 

15 10 15 

Leu Ala Ser Leu 
20 

<210> 4 93 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 493 

Tyr Thr Leu Ala Ser Leu Tyr His Arg Glu Lys Gin Val Phe Leu Pro 

15 10 15 

Lys Tyr Arg Gly 

20 

<210> 494 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 494 

Leu Pro Lys Tyr Arg Gly Asp Thr Gly Gly Ala Ser Ser Glu Asp Ser 

1 5 10 15 

Leu Met He Ser 
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20 

<210> 495 
<211> 20 
<212> ?RT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 



<400> 495 

Asp Ser Leu Met Thr Ser Phe Leu Pro Gly Pro Lys Pro Gly Ala Pro 

15 10 15 

Phe Pro Asn Gly 
20 

<210> 496 
<211> 21 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Made in a lab 
<400> 496 

Ala Pro Phe Pro Asn Gly His Val Gly Ala Gly Gly Ser Gly Leu Leu 

15 10 15 

Pro Pro Pro Pro Ala 
20 

<210> 497 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 497 

Leu Leu Pro Pro Pro Pro Ala Leu Cys Gly Ala Ser Ala Cys Asp Val 

15 10 15 

Ser Val Arg Val 
20 

<210> 498 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 498 

Asp Val Ser Val Arg Val Val Val Gly Glu Pro Thr Glu Ala Arg Val 

1 5 10 15 

Val Pro Gly Arg 
20 
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<210> 499 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 499 

Arg Val Val Pro Gly Arg Gly He Cys Leu Asp Leu Ala lie Leu Asp 

15 10 15 

Ser Ala Phe Leu 
20 

<210> 500 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 500 

Leu Asp Ser Ala Phe Leu Leu Ser Gin Val Ala Pro Ser Leu Phe Met 

15 10 15 

Gly Ser lie Val 
20 

<210> 501 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 501 

Phe Met Gly Ser He Val Gin Leu Ser Gin Ser Val Thr Ala Tyr Met 

15 10 15 

Val Ser Ala Ala 
20 

<210> 502 
<211> 414 
<212> DNA 
<213> Homo Sapien 

<220> 

<221> itiisc^f eature 
<222> {1),..(414) 
<223> n=A, T,C or G 

<400> 502 

caccatggag acaggcctgc gctggctttt cctggtcgct gtgctcaaag gtgtccaatg 60 

tcagtcggtg gaggagtccg ggggtcgcct ggtcacgcct gggacacctt tgacantcac 120 

ctgtagagtt tttggaatng acctcagtag caatgcaatg agctgggtcc gccaggctcc 180 

agggaagggg ctggaatgga tcggagccat tgataattgt ccacantacg cgacctgggc 24 0 
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'J.' .-iq ,j-"'cga ttnatnattt ccaaaacctn gaccacggtg gatttgaaaa tgaccagtcc 300 
Macadccgag gacacggcca cctarttrtq tggcagaatg aatactggta atagtggttg 360 
a -i : '1-^ at t tqaqqcccaq gcacccrggt caccgtmicc tcagggcaac ctaa 414 

<210> 503 
<211> 379 
<212> DNA 
< 2 1 3 > HoriO S api en 

<220> 

<221> mis cofeature 
''^222> ( 1 ) ... (379) 
<223> n=A,T,C or G 



<400> 503 

atncgatggt o-^t^gatcaa aggtgtccag tgtcagtcgg tggaggagtc cgggggtcgc 60 

ctggtcacgc ctT3;;acacc cctgacactc acctgcaccg tntctggatt ngacatcagt 120 

agctatgqag tqaqj^^gqgt ccgccaggct ccagggaagg ggctggnata catcggatca 180 

ttagtagtaq Lqatacattt tacgcgagct gggcgaaagg ccgattcacc atttccaaaa 240 

izctngaccdc ggt.gv;atttg aaaatcacca gtttgacaac cgaggacacg gccacctatt 300 

tntgtgccag ag-iqqgattt aattataaag acatttgggg cccaggcacc ctggtcaccg 360 

tntccttaqg ij^j^cctaa 379 



<2I0> ':0A 
<211> 19 
<212> FRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 



<400> 50A 

vJiy Phe Thr Asn Tyr Thr Asp Phe Glu Asp Ser Pro Tyr Phe Lys Glu 

15 10 15 

Asn Ser Ala 



<210> 505 
<211> 20 
<212> PRT 

<213> Artificial Sequence 

<220> 

<22 3> Made m a lab 
<400> 505 

Lys Glu Asn Ser Ala Phe Pro Pro Phe Cys Cys Asn Asp Asn Val Thr 

i 5 10 15 

Asn Thr Ala Asn 

20 

<210> 5C6 
<211> 407 
<212> DNA 
<213> Homo Sapien 



<400> 506 



w o 01/51633 



PCT/llS<n/(n574 



atggagacag 
tcgctggagg 
accgtctctg 
aaggggctgg 
gtgaaaggcc 
ctgacaaccg 
ttgtggggcc 



gcctgcgctg 
agtccggggg 
gattctccct 
aatacatcgg 
gattcaccat 
aggacacggc 
caggcaccct 



gcttctcctg 
tcgcctggtc 
cagtagcaat 
atacattagt 
ctccaaaacc 
cacctatttc 
ggtcaccgtc 



175 



gtcgctgcgc 
acgcctggga 
gcaatgatct 
tatggtggta 
tcgaccacgg 
tgtgccagaa 
tcctcagggc 



tcaaaggtgt 
cacccctgac 
gggtccgcca 
gcgcatacta 
tggatctgag 
atagtgattt 
aacctaa 



ccagtgtcag 
actcacctgc 
ggctccaggg 
cgcgagctgg 
aatgaccagt 
tagtggtatg 



<210> 507 
<211> 422 
<212> DNA 
<213> Homo Sapien 



<400 
atggagacag 
tcggtggagg 
acagtctctg 
aaggggctgg 
gcaaaaggcc 
ccgacaaccg 
ggtccgtgct 
aa 



> 507 
gcctgcgctg 
agtccggggg 
gattctccct 
aatggatcgg 
ggttcaccat 
aggacacggc 
tgcgcatctg 



gcttctcctg 
tcgcctggtc 
cagcaactac 
gatcattaat 
ctccaaaacc 
cacctatttc 
gggcccaggc 



gtcgctgtgc 
acgcctggga 
gacctgaact 
tatgttggta 
tcgaccaccg 
tgtgccagag 
accctggtca 



tcaaaggtgt 
cacccctgac 
gggtccgcca 
ggacggacta 
tggatctcaa 
ggtggaagtg 
ccgtctcctt 



ccagtgtcag 
actcacctgt 
ggctccaggg 
cgcgaactgg 
gatcgccagt 
cgatgagtct 
^gggcaacct 



<210> 508 
<211> 411 
<212> DNA 
<213> Homo Sapien 

<220> 

<221> inisc_f eature 
<222> {!)... (411) 
<223> n-A,T,C or G 



<400 
atggagacag 
cggtggagga 
cagtctctgg 
aggggctgga 
cgaaaggccg 
cgacaaccga 
ctggttatta 



> 508 
gcctcgctgg 
gtccgggggt 
aatcgacctc 
atggatcgga 
attcaccatc 
ggacacggcc 
taaaatctgg 



cttctcctgg 
cgcctggtca 
agtagctact 
atcattggta 
tccaaaacct 
acctatttct 
ggcccaggca 



tcgctgtgct 
cgcctgggac 
gcatgagctg 
ctcctggtga 
cgaccacggt 
gtgccagaga 
ccctggtcac 



caaaggtgtc 
acccctgaca 
ggtccgccag 
cacatactac 
gcatntgaaa 
tcttcgggat 
cgtctccttg 



<210> 509 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 



cagtgtcagt 
ctcacctgca 
gctccaggga 
gcgaggtggg 
atcnccagtc 
ggtagtagta 

g 



60 
120 
180 
240 
300 
360 
407 



60 
120 
180 
240 
300 
360 
420 
422 



60 
120 
180 
240 
300 
360 
411 



<400> 509 

Leu Cys Lys Phe Thr Glu Trp lie Glu Lys Thr Val Gin Ala Ser 
^5 10 15 

<210> 510 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Made in a .lab 
<400> 510 

r*XL. Glu Tyr Asn Arg Pro Lou Leu Ala Asn Asp Leu Met Leu He 
1 5 10 15 

<210> 511 
<211> 15 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Made in a lab 
<400> 511 

Tyr His Pro Ser Met Phe Cys Ala Gly Gly Gly Gin Asp Gin Lys 

-! 5 10 15 

<210> 512 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 512 

Asp Ser Gly Gly Pro Leu He Cys Asn Gly Tyr Leu Gin Gly Leu 
^5 10 15 

<210> 513 

<211> 15 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 

<400> 513 

Aj.- Pro Cys Gly Gin Val Gly Val Pro Asx Val Tyr Thr Asn Leu 
1 5 10 15 

<210> 514 

<211> 15 

<2l2-> PRT 

<213> Artificial Sequence 

<220> 

<223> Made in a lab 

<400> 514 

i.eu Cys Lys Phe Thr Glu Trp He Glu Lys Thr Val Gin Ala Ser 
1 ^ 5 lb 15 

<210> 515 
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<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 

<400> 515 

Met Val Glu Ala Ser Leu Ser Val Arg His Pro Glu Tyr Asn Arg 
^5 10 15 

<210> 516 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<22C> 

<223> Made in a lab 
<400> 516 

Val Ser Glu Ser Asp Thr lie Arg Ser He Ser He Ala Ser Gin 
1 ^ 10 15 

<210> 517 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 517 

Glu Val Cys Ser Lys Leu Tyr Asp Pro Leu Tyr His Pro Ser Met 
15 10 15 

<210> 518 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 518 

Arg Ala Glu Pro Gly Thr Glu Ala Arg Arg His Tyr Asp Glu Gly 
1 5 10 15 

<210> 519 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 519 

Arg Ala Glu Pro Gly Thr Glu Ala Arg Arg Asn Tyr Asp Glu Gly Cys 
1 5 10 15 
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Gly 



<210> 520 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a iab 
<400> 520 

Val Gly Glu Gly Leu Tyr Gin Gly Val Pro Arg Ala Glu Pro Gly Thr 

1 5 10 15 

Glu Ala Arg Arg His Tyr Asp Glu Gly 
20"^ 25 

<210> 521 
<211> 21 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Made in a lab 
<400> 521 

Ala Pro Phe Pro Asn Gly His Val Gly Ala Gly Gly Ser Gly Leu Leu 

15 10 15 

Pro Pro Pro Pro Ala 
20 

<210> 522 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Made in a lab 
<400> 522 

Leu Leu Val Val Pro Ala lie Lys Lys Asp Tyr Gly Ser Gin Glu Asp 

15 10 15 

Phe Thr Gin Val 
20 

<210> 523 
<211> 254 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Made in a lab 
<220> 

<221> VARIANT 

<222> (1) . - . (254) 

<223> Xaa = any amino acid 
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<400> 523 

Mot Ala Thr Ala Gly Asn Pro Trp Gly Trp Phe Leu Gly Tyr Leu He 

^5 10 15 

Leu Gly Val Ala Gly Ser Leu Val Ser Gly Ser Cys Ser Gin He He 

20 25 30 

Asn Gly Glu Asp Cys Ser Pro His Ser Gin Pro Trp Gin Ala Ala Leu 

35 40 45 

Val Met Glu Asn Glu Leu Phe Cys Ser Gly Val Leu Val His Pro Gin 

50 55 60 

Trp Val Leu Ser Ala Thr His Cys Phe Gin Asn Ser Tyr Thr He Gly 
^5 70 75 80 

Leu Gly Leu His Ser Leu Glu Ala Asp Gin Glu Pro Gly Ser Gin Met 

85 90 95 

Val Glu Ala Ser Leu Ser Val Arg His Pro Glu Tyr Asn Arg Pro Leu 

100 105 110 

Leu Ala Asn Asp Leu Met Leu He Lys Leu Asp Glu Ser Val Ser Glu 

120 125 
Ser Asp Thr He Arg Ser He Ser He Ala Ser Gin Cys Pro Thr Ala 

130 135 140 

Gly Asn Ser Cys Leu Val Ser Gly Trp Gly Leu Leu Ala Asn Gly Arg 

150 155 160 

Met Pro Thr Val Leu Gin Cys Val Asn Val Ser Val Val Ser Glu Glu 

1^5 170 175 

Val Cys Ser Lys Leu Tyr Asp Pro Leu Tyr His Pro Ser Met Phe Cys 

180 185 190 

Ala Gly Gly Gly Gin Xaa Gin Xaa Asp Ser Cys Asn Gly Asp Ser Gly 

200 205 
Gly Pro Leu He Cys Asn Gly Tyr Leu Gin Gly Leu Val Ser Phe Glv 

210 215 220 

Lys Ala Pro Cys Gly Gin Val Gly Val Pro Gly Val Tyr Thr Asn Leu 
225 230 235 240 

Cys Lys Phe Thr Glu Trp He Glu Lys Thr Val Gin Ala Ser 
245 250 

<210> 524 
<211> 765 
<212> DNA 
<213> Homo sapien 



<400> 524 

atggccacag 

ggatcgctcg 

tcgcagccct 

gtgcatccgc 

ctgggcctgc 

ctctccgtac 

aagttggacg 

tgccctaccg 

atgcctaccg 

ctctatgacc 

gactcctgca 

gtgtctttcg 

tgcaaattca 



caggaaatcc 
tctctggtag 
ggcaggcggc 
agtgggtgct 
acagtcttga 
ggcacccaga 
aatccgtgtc 
cggggaactc 
tgctgcagtg 
cgctgtacca 
acggtgactc 
gaaaagcccc 
ctgagtggat 



ctggggctgg 
ctgcagccaa 
actggtcatg 
gtcagccgca 
ggccgaccaa 
gtacaacaga 
cgagtctgac 
ttgcctcgtt 
cgtgaacgtg 
ccccagcatg 
tggggggccc 
gtgtggccaa 
agagaaaacc 



ttcctggggt 
atcataaacg 
gaaaacgaat 
cactgtttcc 
gagccaggga 
cccttgctcg 
accatccgga 
tctggctggg 
tcggtggtgt 
ttctgcgccg 
ctgatctgca 
gttggcgtgc 
gtccaggcca 



acctcatcct 
gcgaggactg 
tgttctgctc 
agaactccta 
gccagatggt 
ctaacgacct 
gcatcagcat 
gtctgctggc 
ctgaggaggt 
gcggagggca 
acgggtactt 
caggtgtcta 
gttaa 



tggtgtcgca 
cagcccgcac 
gggcgtcctg 
caccatcggg 
ggaggccagc 
catgctcatc 
tgcttcgcag 
gaacggcaga 
ctgcagtaag 
agaccagaag 
gcagggcctt 
caccaacctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
765 



<210> 525 
<211> 254 
<212> PRT 
<213> Homo sapien 
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<4O0> 525 





Ala 


Thr 


Ala 


Gly 


Asn 


Pro 


Trp 


Gly 


Trp 


Phe 


Leu 


Gly 


Tyr 


Leu 


He 


1 








5 










10 










15 




I,e"j 


Gly 


Val 


Ala 


G^y 


Ser 


Leu 


Val 


Ser 


Gly 


Ser 


Cys 


Ser 


Gin 


He 


He 






20 










25 










30 






Asn 


Gly 


Glu 

35 


Asp 


Cys 


Ser 


Pro 


His 
40 


Ser 


Gin 


Pro 


Trp 


Gin 
45 


Ala 


Ala 


Leu 


Va.' 


Met 
50 


Glu 


Asn 


Glu 


Leu 


Phe 
55 


Cys 


Ser 


Gly 


Val 


Leu 
60 


Val 


His 


Pro 


Gin 


■ 


Va: 


Leu 


Ser 


Ala 


Ala 


His 


Cys 


Phe 


Gin 


Asn 


Ser 


Tyr 


Thr 


He 


Gly 












70 










75 










80 


l.eu 


Gly 


Leu 


His 


Ser 
85 


Leu 


Glu 


Ala 


Asp 


Gin 

90 


Glu 


Pro 


Gly 


Ser 


Gin 

95 


Met 


Vai 


Glu 


Ala 


Ser 
100 


Leu 


Ser 


Val 


Arg 


His 
105 


Pro 


Glu 


Tyr 


Asn 


Arg 
110 


Pro 


Leu 


Leu 


Ala 


Asn 
115 


Asp 


Leu 


Met 


Leu 


He 
120 


Lys 


Leu 


Asp 


Glu 


Ser 
125 


Val 


Ser 


Glu 


Ser 


Asp 
130 


Thr 


He 


Arg 


Ser 


He 
135 


Ser 


He 


Ala 


Ser 


Gin 
140 


Cys 


Pro 


Thr 


Ala 


Gly 


Asn 


Ser 


Cys 


Leu 


Val 


Ser 


Gly 


Trp 


Gly 


Leu 


Leu 


Ala 


Asn 


Gly 


Arg 


145 










150 










155 










160 


Met 


Pro 


Thr 


Val 


Leu 
165 


Gin 


Cys 


Val 


Asn 


Val 
170 


Ser 


Val 


Val 


Ser 


Glu 
175 


Glu 


Val 


Cys 


Ser 


Lys 
180 


Leu 


Tyr 


Asp 


Pro 


Leu 
185 


Tyr 


His 


Pro 


Ser 


Met 
190 


Phe 


Cys 


Ala 


Gly 


Gly 
195 


Gly 


Gin 


Asp 


Gin 


Lys 
200 


Asp 


Ser 


Cys 


Asn 


Gly 
205 


Asp 


Ser 


Gly 


Gly 


Pro 
210 


Leu 


He 


Cys 


Asn 


Gly 
215 


Tyr 


Leu 


Gin 


Gly 


Leu 
220 


Val 


Ser 


Phe 


Gly 


Lys 


Ala 


Pro 


Cys 


Gly 


Gin 


Val 


Gly 


Val 


Pro 


Gly 


Val 


Tyr 


Thr 


A.sn 


Leu 


225 










230 










235 










240 


Cys 


Lys 


Phe 


Thr 


Glu 


Trp 


He 


Glu 


Lys 


Thr 


Val 


Gin 


Ala 


Ser 







245 250 



<210> 526 
■.211> 963 
:212> DNA 

':213> Homo sapiens 



<400> 526 
atqagttcct 
aadgcccatt 
aactgcatcg 
L ttctctgca 
cttgcccttt 
ttctttattc 
cgttatgtgg 
gcccagattg 
ctgatcaagc 
caggatgtaa 
gccatt ctgc 
i t (J ■'^ -ii riacgg 
g tgt cacaca 
caccgctttg 
qc'tgcctc 



gcaacttcac 
tctgggttgg 
tggtcttcat 
tgcttgcagc 
tctggtttga 
atgccctct c 
ccatctgcca 
gcatcgtggc 
ggctggcctt 
tgaagttggc 
tggtcatggg 
tt ctgcaact 
ttggtgtggt 
gaaacagcct 
ct gtcatcaa 



acatgccacc 
cttccccctc 
cgtaaggacg 
cattgacctg 
ttcccgagag 
agccattgaa 
cccactgcgc 
tgtggtccgc 
ctgccactcc 
ctatgcagac 
cgtggacgta 
gccttccaag 
act cgcctt c 
tcatcccatt 
t cccatcatc 
gat cagct gt 



tttgtgctta 
cttt ccatgt 
gaacgcagcc 
gccttatcca 
attagctttg 
tccaccatcc 
catgctgcag 
ggat ccctct 
aatgtcctct 
actttgccca 
atgttcatct 
tcagagcggg 
tatgtgccac 
gtgcgtgttg 
tatggtgcca 
gacaaggact 



ttggtatccc 
atgtagtggc 
tgcacgctcc 
catccaccat 
aggcctgtct 
tgctggccat 
tgctcaacaa 
tttttttccc 
cgcactccta 
atgtggtata 
ccttgtccta 
ccaaggcctt 
ttattggcct 
tcatgggtga 
aaaccaaaca 
t gcaggct gt 



aggattagag 60 
aatgtttgga 120 
gatgtacctc 180 
gcctaagatc 240 
tacccagatg 300 
ggcctttgac 360 
tacagtaaca 420 
actgcctctg 480 
ttgtgtccac 540 
tggtcttact 600 
ttttctgata 660 
tggaacctgt 720 
ctcagttgta 780 
catctacctg 840 
gatcagaaca 900 
gggaggcaag 960 
963 
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<210> 527 
<211> 320 
<212> PRT 
<213> Homo sapiens 

<400> 527 

Met Ser Ser Cys Asn Phe Thr His Ala Thr Phe Val Leu He Gly He 
5 10 15 

Pro Gly Leu Glu Lys Ala His Phe Trp Val Gly Phe Pro Leu Leu Ser 
20 25 30 

Met Tyr Val Val Ala Met Phe Gly Asn Cys He Val Val Phe He Val 
35 40 4S 

Arg Thr Glu Arg Ser Leu His Ala Pro Met Tyr Leu Phe Leu Cys Met 
50 55 60 

Leu Ala Ala He Asp Leu Ala Leu Ser Thr Ser Thr Met Pro Lys He 
^5 70 75 80 

Leu Ala Leu Phe Trp Phe Asp Ser Axg Glu He Ser Phe Glu Ala Cys 
85 90 95 

Leu Thr Gin Met Phe Phe He His Ala Leu Ser Ala He Glu Ser Thr 
100 105 110 

He Leu Leu Ala Met Ala Phe Asp Arg Tyr Val Ala He Cys His Pro 
115 120 125 

Leu Arg His Ala Ala Val Leu Asn Asn Thr Val Thr Ala Gin He Gly 
130 135 140 

He Val Ala Val Val Arg Gly Ser Leu Phe Phe Phe Pro Leu Pro Leu 

150 155 160 

Leu He Lys Arg Leu Ala Phe Cys His Ser Asn Val Leu Ser His Ser 
165 170 175 

Tyr Cys Val His Gin Asp Val Met Lys Leu Ala Tyr Ala Asp Thr Leu 
180 185 190 

Pro Asn Val Val Tyr Gly Leu Thr Ala He Leu Leu Val Met Gly Val 
195 200 205 

Asp Val Met Phe He Ser Leu Ser Tyr Phe Leu He He Arg Thr Val 
210 215 220 

Leu Gin Leu Pro Ser Lys Ser Glu Arg Ala Lys Ala Phe Gly Thr Cys 

230 235 240 

Val Ser His He Gly Val Val Leu Ala Phe Tyr Val Pro Leu He Gly 
245 250 255 

Leu Ser Val Val His Arg Phe Gly Asn Ser Leu His Pro He Val Arg 
260 265 270 
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ViJ VEil Met Gly Asp He Tyr Leu Leu Leu Pro Pro Val He Asn Pro 
275 ^ * 280 285 

Tie He Tyr Gly Ala Lys Thr Lys Gin He Arg Thr Arg Val Leu Ala 

290 ^ 295 300 

^At,t ?tie Lys He Ser Cys A.sp Lys Asp Leu Gin A.la Val Gly Gly Lys 
305 310 315 320 

<210> 528 

<2H> 20 

<212> DNA 

<213> Homo Sapien 



<400> 528 
actatggtcc agaggctgtg 



20 



<210> 529 

<211> 20 

<212> DNA 

<213> Homo Sapien 



<400> 529 
atcacctatg tgccgcctct 



2 0 



<210> 530 

<211> 1852 

<212> DNA 

<213> Homo sapiens 



<400> 530 

qgcacgagaa 

a ;jaaccacct 

r ttcctctga 

tgtgtctgtt 

ttattgact t 

ctgcggcagc 

g-cggcgctga 

ggagttcttc 

tggatgaaga 

ttgggtaggt 

aaagtgtttg 

gctttctcca 

ccatcgtgca 

ggcaagagca 

agcaagaggt 

aacgtggtcg 

catggagaag 

gat ctcat eg 

get ctacat c 

cgat gtcaac 

tgccaggaag 

gatgagtatg 

aaagcact go 

ctgctacttg 

qcgaatttaa 

oga ccagcaa 

ctggaaagac 



ttaaaaccct 
atgacaagcc 
gaactgcaac 
gagat gctta 
gcctgtgt ta 
ttcgggataa 
tggctgagga 
cttcatagtt 
gtattacgtt 
tccaccatgt 
tttgtgaatg 
ccttgctgga 
tgcatctttc 
acgtgggcac 
gcaagtggtg 
cttggggaga 
at ctggacaa 
tcatgct cag 
tggcctctgc 
tt aatgtcct 
atgaatgtgc 
gaaataccac 
tcttatacgg 
gtatacatga 
atgcgctgga 
Qua i_agi_cag 
ggccagagag 



cagcaaaaca 
cacagccaac 
aataaataca 
tgtgactttg 
gaccggaaga 
cttgaggctg 
cagagcttca 
catccatatg 
gtgcagatat 
tgccgcagat 
gatattgtgg 
agtgacctgc 
atttcctgca 
ttctggagac 
ctgccactgc 
ctacgatgac 
gctccacaga 
ggacacggat 
caat gggaat 
tgacaacaaa 
gt taat gttg 
tctacactat 
tgctgatatc 
gcaaaaacag 
tagatatgga 
ccctctactt 
tatgctgttt 



ggcatagaag 
ataatactaa 
aggatgctgg 
cttttaattc 
gctggggtgt 
catcactggg 
gtgtggcttc 
gctccagagg 
actgcagtgt 
gacatgattt 
tttctggatc 
tgtccagaag 
tttcttcctc 
cacaacgact 
ttcccctgct 
agcgccttca 
gctgcctggt 
gtgaacaaga 
tcagaagtag 
aagaggacag 
ctggaacatg 
gctgtctaca 
gaatcaaaaa 
caagtggtga 
agaactgctc 
gagcaaaatg 
ctagtcatca 



ggacatacct 
atggggaaaa 
attttgtcaa 
tgtttatgtg 
tt ctcaggag 
gaagaaacac 
tctgcgactg 
aaaattatat 
cttcatctct 
cagtacctgt 
tcatcctctg 
tttgatggct 
cctggatgga 
cctctgtgaa 
gcagggggag 
tggatcccag 
ggggtaaagt 
gggacaagca 
taaaactcgt 
ctctgacaaa 
gcactgatcc 
atgaagataa 
acaagcatgg 
aatttttaat 
tcatacttgc 
ttgatgtatc 
tcatgtaatt 



taaagtaata 
gttagaagca 
atgccttttc 
attatcacat 
ccaccgtgtg 
aytcctgtcc 
gcttcttcgg 
tattttgtta 
tgatgtgtga 
gtctggctga 
tgggtggaca 
gaggagtata 
cagggggagc 
gacgcttggg 
cggcaagagc 
gtaccacgtc 
ccccagaaag 
aaagaggact 
gctggacaga 
ggccgtacaa 
aaatattcca 
attaatggcc 
cctcacacca 
caagaaaaaa 
tgtatgttgt 
ttctcaagat 
tgccagttac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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tttctgacta caaagaaaaa cagatgttaa aaatctcttc tgaaaacagc aatccagaac 1680 

aagacttaaa gctgacatca gaggaagagt cacaaaggct taaaggaagt gaaaacagcc 1740 

agccagagct agaagattta tggctattga agaagaatga agaacacgga agtactcatg 1800 

tgggattccc agaaaacctg actaacggtg ccgctgctgg caatggtgat ga 1852 

<210> 531 

<211> 879 

<212> DNA 

<213> Homo sapiens 

<400> 531 

atgcatcttt catttcctgc atttcttcct ccctggatgg acagggggag cggcaagagc 60 

aacgtgggca cttctggaga ccacaacgac tcctctgtga agacgcttgg gagcaagagg 120 

tgcaagtggt gctgccactg cttcccctgc tgcaggggga gcggcaagag caacgtggtc 180 

gcttggggag actacgatga cagcgccttc atggatccca ggtaccacgt ccatggagaa 240 

gatctggaca agctccacag agctgcctgg tggggtaaag tccccagaaa ggatctcatc 300 

gtcatgctca gggacacgga tgtgaacaag agggacaagc aaaagaggac tgctctacat 360 

ctggcctctg ccaatgggaa ttcagaagta gtaaaactcg tgctggacag acgatgtcaa 420 

cttaatgtcc ttgacaacaa aaagaggaca gctctgacaa aggccgtaca atgccaggaa 480 

gatgaatgtg cgttaatgtt gctggaacat ggcactgatc caaatattcc agatgagtat 540 

ggaaatacca ctctacacta tgctgtctac aatgaagata aattaatggc caaagcactg 600 

ctcttatacg gtgctgatat cgaatcaaaa aacaagcatg gcctcacacc actgctactt 660 

ggtatacatg agcaaaaaca gcaagtggtg aaatttttaa tcaagaaaaa agcgaattta 720 

aatgcgctgg atagatatgg aagaactgct ctcatacttg ctgtatgttg tggatcagca 780 

agtatagtca gccctctact tgagcaaaat gttgatgtat cttctcaaga tctggaaaga 840 

cggccagaga gtatgctgtt tctagtcatc atcatgtaa 879 

<210> 532 
<211> 292 
<212> PRT 
<213> Homo sapiens 

-^4 G 532 

Met His Leu Ser Phe Pro Ala Phe Leu Pro Pro Trp Met Asp Arg Gly 
5 10 15 

Ser Gly Lys Ser Asn Val Gly Thr Ser Gly Asp His Asn Asp Ser Ser 
20 25 30 

Val Lys Thr Leu Gly Ser Lys Arg Cys Lys Trp Cys Cys His Cys Phe 
35 40 45 

Pro Cys Cys Arg Gly Ser Gly Lys Ser Asn Val Val Ala Trp Gly Asp 
50 55 60 

Tyr Asp Asp Ser Ala Phe Met Asp Pro Arg Tyr His Val His Gly Glu 
65 70 75 80 

Asp Leu Asp Lys Leu His Arg Ala Ala Trp Trp Gly Lys Val Pro Arg 
85 90 95 

Lys Asp Leu lie Val Met Leu Arg Asp Thr Asp Val Asn Lys Arg Asp 
100 105 110 

Lys Gin Lys Arg Thr Ala Leu His Leu Ala Ser Ala Asn Gly Asn Ser 
115 120 125 

Glu Val Val Lys Leu Val Leu Asp Arg Arg Cys Gin Leu Asn Val Leu 
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J 4 ' 



:i 30 

/^sr: Lys uys Arc 



135 

Thr Ala Leu 
150 



140 

:hr Lys Ala Val Gin Cys Gin Glu 



155 



160 



hsp Glu Cys Ala Leu 
165 



Met Leu Leu Glu His Gly Thr Asp Pro Asn lie 
170 175 



Pro Asp Glu Tyr Gly 
180 



Asn Thr Thr Leu His Tyr Ala Val Tyr Asn Glu 
185 190 



Asf- Lys Leu Met Ala 
195 



Lys Ala Leu Leu Leu Tyr Gly Ala Asp lie Glu 
200 205 



Ser Lys Asn Lys His 

210 



Gly Leu Thr Pro Leu Leu Leu Gly lie His Glu 
215 220 



Gxii 

225 



Lys Gin Gin Val 



Val Lys Phe Leu lie Lys Lys Lys Ala Asn Leu 
230 235 240 



Asn Ala Leu Asp Arg 
245 



Tyr Gly Arg Thr Ala Leu lie Leu Ala Val Cys 

250 255 



^yy: Gly Ser Ala Ser 
260 



lie Val Ser Pro Leu Leu Glu Gin Asn Val Asp 
265 270 



Val Ser Ser Gin Asp 
275 



Leu Glu Arg Arg Pro Glu Ser Met Leu Phe Leu 
280 285 



Val 



lie He Met 

290 



'^210> 533 
<211> 801 
<212> DNA 

<213> Homo sapiens 



<400> 533 
atgtacaagc 
gcaggctcag 
tatgccactg 
tttatgttcc 
ccctggccag 
gtacctcaaa 
tctgcagggg 
ggatggccat 
ttaggaaaag 
cctgcagcga 
actggttgct 
ctgtgctaca 
gctgcctttg 
tgccaagag 

-:210> 534 
<211> 266 
212.> PRT 
<213> Homo sapiens 



ttcagtgcaa 
gagcagggta 
cacgattctt 
actgcagaaa 
acat agaatt 
tctgtgcgtg 
tgcgtctgga 
gtt cccacag 
cagatggccc 
gtgaggttgg 
ctttgagccc 
aatggagcca 
ggggctccag 
gcagaccata 



caactgtgct 
tgcgctgcct 
ggttgccaag 
gcagccagga 
tgt gccaagg 
ccagacaagg 
ggaggtggac 
cctgagtggc 
ttggccctac 
tggctgtgcc 
tcttagcctt 
tataggggaa 
tccttgcctc 

g 



acaaatggag 
tcggctctcc 
aggccaacca 
tcaccatcca 
caggacaagc 
ccaaactggc 
cagccaccaa 
tgccacctga 
ctttttgtta 
cccagctcct 
gcccagcatg 
acgagcagcc 
aagggtctta 



ccacagagag 
aatccatgcc 
caggccatct 
ggggacttgg 
tcactcagag 
tcaatgagca 
ccttacccag 
tggctgatat 
gaagaactga 
ggcacaccct 
cacaagcctc 
atctcaggag 
tgtcactgtg 



gaaacaagca 
tcagggctcc 
tgagaaggag 
tcttctgtgg 
cagcgtgtta 
accagccacc 
t caaggaagt 
agcaaaggcc 
tgttccatgt 
cgcagaggtg 
agtgctacta 
caaggtgtat 
ggcttcttgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

801 
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<400> 534 

Met Tyr Lys Leu Gin Cys Asn Asn Cys Ala Thr Asn Gly Ala Thr Glu 
5 10 15 

Arg Lys Gin Ala Ala Gly Ser Gly Ala Gly Tyr Ala Leu Pro Ser Ala 
20 25 30 

Leu Gin Ser Met Pro Gin Gly Ser Tyr Ala Thr Ala Arg Phe Leu Val 
35 40 45 

Ala Lys Arg Pro Thr Thr Gly His Leu Glu Lys Glu Phe Met Phe His 
50 55 60 

Cys Arg Lys Gin Pro Gly Ser Pro Ser Arg Gly Leu Gly Leu Leu Trp 
65 70 75 80 

Pro Trp Pro Asp lie Glu Phe Val Pro Arg Gin Asp Lys Leu Thr Gin 
85 90 95 

Ser Ser Val Leu Val Pro Gin He Cys Ala Cys Gin Thr Arg Pro Asn 
100 105 110 

Trp Leu Asn Glu Gin Pro Ala Thr Ser Ala Gly Val Arg Leu Glu Glu 
115 120 125 

Val Asp Gin Pro Pro Thr Leu Pro Ser Gin Gly Ser Gly Trp Pro Cys 
130 135 140 

Ser His Ser Leu Ser Gly Cys His Leu Met Ala Asp He Ala Lys Ala 
145 150 155 160 

Leu Gly Lys Ala Asp Gly Pro Trp Pro Tyr Leu Fhe Val Arg Arg Thr 
165 170 175 

Asp Val Pro Cys Pro Ala Ala Ser Glu Val Gly Gly Cys Ala Pro Ser 
180 185 190 

Ser Trp His Thr Leu Ala Glu Val Thr Gly Cys Ser Leu Ser Pro Leu 
195 200 205 

Ser Leu Ala Gin His Ala Gin Ala Ser Val Leu Leu Leu Cys Tyr Lys 
210 215 220 

Trp Ser His He Gly Glu Thr Ser Ser His Leu Arg Ser Lys Val Tyr 
225 230 235 240 

Ala Ala Phe Gly Gly Ser Ser Pro Cys Leu Lys Gly Leu Met Ser Leu 
245 250 255 

Trp Ala Ser Trp Leu Pro Arg Gly Arg Pro 
260 265 



<210> 535 

<211> 6082 

<212> DNA 

<213> Homo sapiens 
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.;■ .:qaqc 
c q q a ccccqc 
g c d q d t g c t 
qq: cacacgt 
• q g q a a q a 
q q 1 ^ q c a a g g 
ct t taacaag 
q.nq. qattga 
o ' \ t * g a a a a 
cqgtgct gac 
ttcact gi gc 
utcgt ct t ag 
ccaatgatq*^ 
cactgcagg^: 
ctqqga tggc 
catcactgaq 
ttataact qg 
ttaccaat 1 1 
t gaatt tqgc 
dcgtgctcct 
oggctgtgcG 
caatcgt cag 
accgtcagct 
gggataaggc 
aattgtt age 
tcggggaatt 
ctcagcagcc 
acgaaaagga 
tgttggagga 
agaaagcacg 
acgatcctct 
gtcaaatttt 
ctgcaagtca 
agt tcctaaa 
aacaacctcc 
tttggtctca 
cagagaatgt 
aggcctataa 
t cctaaacac 
caaacaaaca 
tagatct taa 
gratagcaag 
acaaaatgtt 
gaagaatttt 
cgtttttaga 
Gcgtgattcc 
ggcgatattt 
cagtgttttc 
cagaagagag 
t c ^ tgttttt 
ttg tea teat 
ttggtttggc 
ac^agtgctga 
tt qaaaaaga 
gqgr gat a at 



tacugctcci'L: 
ggacagqcg u 
ggccact qqc 
gcccqt grac 
gttctr ct gg 
tgatarqtat 
gt tctgggat 
agcaat cata 
ggaaagtqcc 
t tatgatccc 
1 1 1 1 tqcacg 
t ggga t gagg 
^aacatggcc 
qaacaagtt t 
qatcgcaatg 
d^rtctaaic 
gjgtaaaact 
tatuaggata 
q a q a a a g a a g 
ttcgttt ttc 
cggv-<iqt gt g 
gctqacqgtt 
cat ccqaaga 
gccgt cagat 
a t Cdgagacc 
tgtggtcggc 
ggccccaagt 
ct gggtgtt c 
acgatatgaa 
tggtgat ct g 
ggtaaacct t 
cagtgcagt a 
gcatgagaag 
gattctgata 
atctggtata 
agttccagga 
acaatcttct 
cccagtt aca 
gaattactt c 
tgcagct cag 
aagtatgct a 
ctqgtactt a 
at ctctatt g 
tgagt caatt 
aaatcgt tt c 
tttcatccag 
ttggatcgca 
tttggaaacg 
ccacttgtca 
gt gtcaggaa 
gacaa eg tec 
cgttgcett -L 
act gt ect at 
agttgagaat 
agcacct tqg 
ctttgacaat 



aggaaattac 
ggcggccgga 
gcctgatcag 
caggaggtga 
tggct caatc 
t cagtgctgc 
aaagaagttt 
aagtgttact 
aaagtaatcc 
atggattctg 
ctcattttgg 
ttacgagtag 
atggggaaga 
gatcaggtga 
actgccctac 
attctcctgc 
gcaactttca 
a taaaaatgt 
gagatttcca 
agtgcaagca 
atcaeagcca 
accctcttct 
atccagacct 
ggt aaaaaga 
ccaact ctac 
cccgtgggag 
cacgggctgg 
t cgggaactc 
aaagtcataa 
actgtgatag 
gcaagagcag 
gatgcggaag 
atcacaattt 
ttgaaagatg 
gattttggct 
actcccacac 
agaccctcct 
ctatcagagg 
agagctggtg 
gttgcctatg 
aatgtcactg 
ggaatttatt 
gtattctacg 
etgaaagctc 
tccaaagaca 
acattgctac 
atacccttgg 
tcaagagatg 
tcttct ctcc 
ctgtttgatg 
egctggttcg 
gggt ccctga 
ceectcacgc 
atgatgatct 
qaatatcaga 
gtgaacttca 



aat ceacttt 
gccccagcat 
cgcgaccccg 
agcccaaccc 
ccttgttt aa 
cagaagaccg 
taagagctga 
ggaaatctta 
agcccatatt 
tggctttgaa 
ctatactgca 
ccatgtgcca 
caaccaeagg 
cagtgttctt 
tctggatgga 
ccttgcaaag 
cggatgccag 
acgcctggga 
agattctgag 
aaatcatcgt 
gccgcgtgtt 
tcGcctcagc 
ttttgctact 
tggtgcatgt 
aaggcctttc 
cagggaagtc 
tcagcgtgca 
tgaggagtaa 
aggcttgtgc 
gagatcgggg 
tgtatcaaga 
ttagcagaca 
tagtgactca 
gtaaaatggt 
cccttttaaa 
taaggaatcg 
tgaaagatgg 
agaaccgttc 
ctcactggat 
tgcttcaaga 
taaatggagg 
caggtttaac 
tccttgttaa 
cggtattatt 
ttggacactt 
aagtggttgg 
tt ccccttgg 
tgaagcgcct 
aggggctctg 
cacaccagga 
ccgtccgtct 
ttctggcaaa 
tcatggggat 
eagtagaaag 
aacgeccacc 
tgtacagtcc 



acaggcct ca 
ccctgcttga 
gcccgcgccc 
gctgcaggac 
aa trggccat 
ctcacagcac 
gaatgacgca 
trragttttg 
tttgggaaaa 
cacagcgtac 
tcacttatat 
tatgatttat 
ccagatagt c 
acacttcctg 
gataggaata 
ctgttttggg 
gatcaggacc 
aaagtcattt 
aagttcctgc 
gtttgtgacc 
cgtggcagtg 
cattgagagg 
tgatgagata 
gcaggatttt 
ctttactgtc 
atcactgtta 
tggaagaatt 
tattttattt 
tctgaaaaag 
aaccacgctg 
tgctgacatc 
cttgttcgaa 
tcagttgcag 
gcagaagggg 
gaaggataat 
taccttctca 
tgetctggag 
tgaaggaaaa 
tgtcttcatt 
ttggtggctt 
aggaaatgta 
tgtagctacc 
ctcttcacaa 
ctttgataga 
ggatgatttg 
tgtggtctct 
aat catt ttc 
ggaatctaca 
gaccat cegg 
tttacattca 
ggatgccatc 
aactctggat 
gtttcagtgg 
ggtcattgaa 
accagcctgg 
aggtgggcct 



aaggtt catt 
ggt ccaggag 
gccccgcccg 
gcgaacct ct 
aaacggagat 
cttggagagg 
cagaagcctt 
ggaattttta 
attattaatt 
gcct atgcca 
ttttatcacg 
cggaaggcac 
aatctgctgt 
tgggcaggac 
tcgtgccttg 
aagttgttct 
atgaatgaag 
tcaaatctta 
ctcaggggga 
ttcaccacct 
acgctgtatg 
gtgtcagagg 
tcacagcgca 
actgcttttt 
agacctggcg 
agtgccgtgc 
gcctatgtgt 
gggaagaaat 
gatttacagc 
agtggagggc 
tatctcctgg 
ctgtgtattt 
tacctcaaag 
acttacactg 
gaggaaagtg 
gagtcttcgg 
agccaagata 
gttggttttc 
ttccttattc 
tcatactggg 
accgagaagc 
gttctttttg 
actttgcaca 
aatccaatag 
ctgccgctga 
gtggctgtgg 
atttttcttc 
actcggagtc 
gcatacaaag 
gaggcttggt 
tgtgccatgt 
gccgggcagg 
tgtgtt cgac 
tacacagacc 
ccccatgaag 
ctggtactga 
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agcatctgac 
ctggaaaaag 
ggattgataa 
tcatacctca 
atgagcacac 
ttgaagatct 
ttggacaaag 
ttattgatga 
ccgggagaaa 
tgacagcgac 
tgttttgctg 
agaagccgct 
tattggtcac 
tattttcgag 
ccactagttt 
ttatacatac 
gctctaacaa 
cagaggtgca 
gagtcaaagc 
ttattgcccc 
gactgtgaag 
cccctgaagg 
aaaagtttgt 
tggatacatg 
agagaaaata 
caaatgtgac 
ttttagactc 
aatacgtggc 
tgtaagcctt 
ctaacgtgat 
tctcaggatg 



cactaatagc 

gtaatatgtt 
tgacctctca 
acgtgtgcat 
agcagagagc 
taactgccaa 
tatggttaat 
cgtattttag 
tatattcaaa 
gtcctgtttc 
taaatgatag 
ggcccgttta 
tttgcccctc 
aataaaatga 



agcactcatt 
ttccctcatc 
gatcttgaca 
ggaacctgtt 
ggatgaggaa 
tcctggtaaa 
acaactggtg 
agcgacggca 
tttgcccact 
aagataatgg 
caaaataaag 
gccctcactg 
actgaccaca 
acagcactgt 
ttggactatg 
aagatgctag 
aatggtttat 
ggccaccagt 
tcatttttaa 
caaaattaca 
ttgccatttt 
cttcttgtta 
gacattgtag 
gttaaaggat 
ctactttctc 
cgtccatcct 
agcattgtga 
attattcttt 
ttggtttggg 
gagacaaact 
catccagggg 
attgtcattt 
tgtactttat 
gagttcagtt 
gaataaggtg 
ttgtttgtat 
aacagatgtt 
aatacattga 
gttcagtcca 
gtgactgaaa 
atgtctatat 
tgttttaatg 
ttttcatctc 
aacccgctga 
cGccgtgcct 
ggaaattgca 



aaatcacaag 
tcagcccttt 
actgaaattg 
ttgttcactg 
ctgtggaatg 
atggatactg 
tgccttgcca 
aatgtggatc 
gcaccgtgct 
ttttagattc 
agagcctatt 
aaacagcaaa 
tggttacaaa 
gaatccaacc 
taaaccacat 
ttcatttgaa 
ttttatttaa 
taaatgccgt 
aggagtagga 
tgttaatttc 
gtctcattgt 
gaaaatagta 
tagggagtgt 
agaagggcaa 
aaaatggaag 
cctttagagt 
cacttcccaa 
tttggatttc 
ctgtattgaa 
gaaaattatt 
ttcattttca 
gggctttctg 
ttgaactatg 
tactaaaggt 
tcacctccct 
acatatattt 
ttattgagtg 
ccgtagtagc 
acaaaccaca 
ttgcaacagt 
gtttatttgg 
ttatcataga 
caaaaaaaaa 
tcagcctcga 
tccttgaccc 
tc 



aaaaggttgg 
ttagattgtc 
gacttcacga 
gaacaatgag 
ccttacaaga 
aattagcaga 
gggcaattct 
caagaactga 
aaccattgca 
aggaagactg 
ttacaagatg 
acaggtatac 
cacttccaat 
aaaatgtcaa 
tgtacttttt 
tatttctccc 
atgtcaatag 
ctatcaggtt 
cagagttgtc 
catttatatc 
tttctttgac 
cagttacaac 
gtacccctta 
tattttatca 
cccttaaagg 
tgcatgactt 
gaaggccaaa 
tcatttatgg 
atcctttcta 
gcaagcattg 
tgagcctgtc 
ttgaatgaat 
agtcttraat 
tttactatta 
gaaattgcat 
gtccttcgta 
aagccttaaa 
tgttcaactc 
cacagtaaat 
gatcataatg 
acttttgagg 
attttttaat 
aaaaaaaagg 
ctgtgccttc 
tggaaggtgc 



catt gtggga 
agaacccgaa 
tttaaggaag 
gaaaaacctg 
ggtacaactt 
atcaggatcc 
caggaaaaat 
tgagttaata 
cacagattga 
aaagaatatg 
gtgcaacaac 
ttcaaaagaa 
ggacagccct 
gtccgttccg 
tttactttgg 
aacttatcca 
ttgtttttta 
ttgtgcctta 
acaggttttt 
agggattcta 
ataactagga 
caataggaac 
ctccccatca 
tatgttctaa 
tgctttgata 
ggacacggta 
cctctaaccg 
aaggctaacc 
aattgcatga 
actataatta 
caggttagtt 
caacaaacca 
ttttcctgat 
tggtttgaag 
atatgtatat 
tagcaagttt 
aagcacacac 
ctagtactta 
gtttattaat 
aggtttgtta 
ttaaagacag 
gaaactaaat 
gcggccgctc 
tagttgccag 
cactcccact 



agaaccggag 
ggtaaaattt 
aaaatgtcaa 
gatcccttta 
aaagaaacca 
aattttagtg 
cagatattga 
caaaaaaaat 
acaccattat 
atgagccgta 
tgggcaaggc 
attatccaca 
cgaccttaac 
aaggcatttg 
caacaaatat 
aggatctcca 
aaatccaaat 
agagactaca 
gttgttgttt 
tttacttgaa 
tccattattt 
aacaaaaaga 
aaaaaaaaaa 
aagagaagga 
ctgaaggaca 
actgttgcag 
acattcctga 
ctctgttgac 
ataggctctg 
tgcagtacgt 
tactcctgac 
caatacttcc 
gatggtggct 
tggagtctca 
agacatgcac 
tttgctcatc 
cacacacagc 
gaaatacacg 
agtcatggtt 
aaatgatagc 
tcatataaac 
tcaattgaaa 
gagtctagag 
ccatctgttg 
gtcctttcct 
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coLjl gycgca 
ct caqccacc 
rirttaqgacc 
' ccat gtgca 
aatccctt gt 
ctgccagaag 
;.|t" ^ taagag 
tiactggaaat 
a' ccagccca 
: t gtggctt 
^ -Lggctatac 
gtagccatgt 
aagacaacca 
gtgacagtgt 
ctact ctgga 
ctgcccttgc 
rtcacggatg 
Btgtacgcct 
Lccaagattc 
agcaaaat ca 
gccagccgcg 
ttcttcccct 
ricc^t ttttgc 
aaqatggtgc 
ctacaaggcc 
ggagcaggga 
ctggtcagcg 
act ct gagga 
ataaaggctt 
ataggagatc 
gc!agtgtatc 
gaagtt agca 
dttt tagtga 
gatggtaaaa 
ggctcccttt 
acactaagga 
tcct tgaaag 
gaggagaacc 
ggtgctcact 
tatgtgcttc 
actgtaaatg 
tattcaggtt 
tacgtccttg 
qctccggtat 
gacattggac 
ctacaagtgg 
ttggttcccc 
gatgtgaagc 
ctccaggggc 
gatgcacacc 
tt cgccgt cc 
rt qattctgg 
:i ■:'gr-t catgg 
at ctcagtag 
'':agaaacgcc 
rt- cat gtaca 
C - g a a a a g g 
ctttttagat 



gtcr cagct c 
agact agcag 
cact tggat a 
aatacccttt 
tt aaaart gg 
accgctcaca 
ctgagaatga 
cttatttagt 
tatttttggg 
tgaacacagc 
tgca wCactt 
gccatatgat 
caggccagat 
tctt acact t 
t ggagat agg 
aaagctgttt 
ccaggatcag 
gggaaaagt c 
tgagaagttc 
tcgtgtttgt 
tgttcgtggc 
cagccattga 
tacttgatga 
atgtgcagga 
tttcctttac 
agtcatcact 
tgcatggaag 
gtaatatttt 
gtgctctgaa 
ggggaaccac 
aagatgct ga 
gacacttgtt 
ctcatcagtt 
tggtgcagaa 
taaagaagga 
atcgtacctt 
atggtgct ct 
gttctgaagg 
ggattgtctt 
aagattggtg 
gaggaggaaa 
taactgtagc 
ttaactcttc 
tattctttga 
acttggatga 
ttggtgtggt 
ttggaatcat 
gcctggaatc 
tctggaccat 
aggatttaca 
gtctggatgc 
caaaaactct 
ggatgtt t ca 
aaagggtcat 
caccaccagc 
gtccaggtgg 
ttggcattgt 
tgtcagaacc 



actgcagcct 
gt ct cccccg 
atccaggat g 
t cccaaataa 
ccat aaacqg 
gcaccttgga 
cgcacagaag 
tttgggaatt 
aaaaatt att 
gtacgcctat 
atatttttat 
ttat cggaag 
agtcaat ct g 
cctgtgggca 
aatat cgtgc 
tgggaagttg 
gaccatgaat 
atttt caaat 
ctgcct cagg 
gacctt cacc 
agtgacgctg 
gagggtgtca 
gatatcacag 
ttttactgct 
tgtcagacct 
gttaagtgcc 
aattgcctat 
atttgggaag 
aaaggattta 
gctgagtgga 
catct atct c 
cgaactgtgt 
gcagtacctc 
ggggacttac 
taat gaggaa 
ctcagagtct 
ggagagccaa 
aaaagttggt 
cattttcctt 
gctttcatac 
tgtaaccgag 
taccgttctt 
acaaactttg 
"cagaaatcca 
tttgctgccg 
ctctgtggct 
tttcattttt 
tacaact egg 
ccgggcat ac 
ttcagaggct 
catct gtgcc 
ggat gccggg 
gtggt gtgtt 
tgaa t acaca 
ct ggcGCcat 
gcctctggta 
gggaagaacc 
cgaaggtaaa 



ccacct cctg 
cctctttctt 
atgtcttcac 
cattcaattc 
agattagagg 
gaggagttgc 
ccttctttaa 
tttacgttaa 
aattattttg 
gccacggtgc 
cacgttcagt 
gcacttcgtc 
ctgtccaatg 
ggaccactgc 
cttgctggga 
tt ctcatcac 
gaagtt ataa 
cttattacca 
gggatgaatt 
acctacgtgc 
tatggggctg 
gaggcaatcg 
cgcaaccgtc 
ttttgggata 
ggcgaattgt 
gtgctcgggg 
gtgtctcagc 
aaatacgaaa 
cagctgttgg 
gggcagaaag 
ctggacgatc 
atttgt caaa 
aaagctgcaa 
actgagttcc 
agtgaacaac 
tcggtttggt 
gatacagaga 
tttcaggcct 
attctcctaa 
tgggcaaaca 
aagctagatc 
tttggcatag 
cacaacaaaa 
ataggaagaa 
ctgacgtttt 
gtggccgtga 
cttcggcgat 
agtccagtgt 
aaagcagaag 
tggttcttgt 
atgtttgtca 
caggttggtt 
cgacaaagtg 
gaccttgaaa 
gaaggagtga 
ctgaagcatc 
ggagct ggaa 
atttggattg 



tgttcaagca 
ggaaggacac 
t ccaacatcc 
t ttaccagga 
aagatgatat 
aagggt tctg 
caagagcaa t 
ttgaggaaag 
aaaattatga 
tgactttttg 
gtgctgggat 
ttagtaacat 
atgtgaacaa 
aggcgatcgc 
tggcagttct 
tgaggagtaa 
ctggtataag 
atttgagaaa 
tggcttcgtt 
tcctcggcag 
tgcggctgac 
tcagcatccg 
agctgccgtc 
aggcatcaga 
tagctgtggt 
aattggcccc 
agccctgggt 
aggaacgata 
aggatggtga 
cacgggtaaa 
ctctcagtgc 
ttttgcatga 
gtcagattct 
taaaatctgg 
ctccagttcc 
ctcaacaatc 
atgtcccagt 
ataagaatta 
acactgcagc 
aacaaagtat 
ttaactggta 
caagatctct 
tgtttgagtc 
ttttaaatcg 
tagatttcat 
ttccttggat 
attttttgga 
t ttcccactt 
agaggtgt ca 
ttttgacaac 
teat cgttgc 
tggcactgtc 
ctgaagttga 
aagaagcacc 
taatctttga 
tgacagcact 
aa agt t ccct 
ataagatctt 



gtcct cctgc 
ttgccattgg 
t cagt ttaat 
aaggt ggct c 
gtattcagtg 
ggat aaagaa 
cataaagtgt 
tgccaaagta 
tcccat ggat 
cacgctcatt 
gaggttacga 
ggccatgggg 
gtttgatcag 
agtgact gcc 
aatcattctc 
aactgcaact 
gat aat aaaa 
gaaggagatt 
ttt cagtgca 
tgtgat caca 
ggttaccctc 
aagaatccag 
agatggt aaa 
gaccccaact 
cggccccgtg 
aagtcacggg 
gttctcggga 
tgaaaaagtc 
tctgactgtg 
ccttgcaaga 
agtagatgcg 
gaagatcaca 
gatattgaaa 
tatagatttt 
aggaact ccc 
ttctagaccc 
tacactatca 
cttcagagct 
tcaggttgcc 
gctaaatgtc 
cttaggaatt 
attggtattc 
aattctgaaa 
tttctccaaa 
ccagacattg 
cgcaataccc 
aacgtcaaga 
gtcatcttct 
ggaactgttt 
gtcccgctgg 
ctttgggtcc 
ctatgccctc 
gaatatgatg 
ttgggaatat 
caatgtgaac 
cattaaatca 
catctcagcc 
gacaactgaa 
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attggacttc acgatttaag gaagaaaatg tcaatcatac ctcaggaacc tgttttgttc 3540 
actggaacaa tgaggaaaaa cctggatccc tttaatgagc acacggatga ggaactgtgg 3600 
aatgccttac aagaggtaca acttaaagaa accattgaag atcttcctgg taaaatggat 3660 
actgaattag cagaatcagg atccaatttt agtgttggac aaagacaact ggtgtgcctt 3720 
gccagggcaa ttctcaggaa aaatcagata ttgattattg atgaagcgac ggcaaatgtg 3780 
gatccaagaa ctgatgagtt aatacaaaaa aaaatccggg agaaatttgc ccactgcacc 3840 
gtgctaacca ttgcacacag attgaacacc attattgaca gcgacaagat aatggtttta 3900 
gattcaggaa gactgaaaga atatgatgag ccgtatgttt tgctgcaaaa taaagagagc 3960 
ctattttaca agatggtgca acaactgggc aaggcagaag ccgctgccct cactgaaaca 4020 
gcaaaacaga gatggggttt caccatgttg gccaggctgg tctcaaactc ctgacctcaa 4080 
gtgatccacc tgccttggcc tcccaaactg ctgagattac aggtgtgagc caccacgccc 4140 
agcctgagta tacttcaaaa gaaattatcc acatattggt cacactgacc acatggttac 4200 
aaacacttcc aatggacagc cctcgacctt aactattttc gagacagcac tgtgaatcca 4260 
accaaaatgt caagtccglt ccgaaggcat ttgccactag tttttggact atgtaaacca 4320 
cattgtactt ttttttactt tggcaacaaa tatttataca tacaagatgc tagttcattt 4380 
gaatatttct cccaacttat ccaaggatct ccagctctaa caaaatggtt tatttttatt 4440 
taaatgtcaa tagtkgkttt ttaaaatcca aatcagaggt gcaggccacc agttaaatgc 4500 
cgtctatcac qttttgtgcc ttaagagact acagnagtca gaagctcatt tttaaaggag 4560 
taggacagag ttgtcacagg tttttgttgg tgtttktatt gcccccaaaa ttacatgtta 4620 
atttccattt atdtcaggqg attctattta cttgaagact gtgaagttgc cattttgtct 4680 
cattgttttc tttgacatam ctaggatcca ttatttcccc tgaaggcttc ttgkagaaaa 4740 
tagtacagtt acaaocaata ggaactamca aaaagaaaaa gtttgtgaca ttgtagtagg 4S00 
gagtgtgtac cccttactcc ccatcaaaaa aaaaaatgga tacatggtta aaggatagaa 4860 
gggcaatatt ttatcatatg ttctaaaaga gaaggaagag aaaatactac tttctcaaaa 4920 
tggaagccct taaaggtgct ttgatactga aggacacaaa tgtgaccgtc catcctcctt 4980 
tagagttgca tgacttggac acggtaactg ttgcagtttt agactcagca ttgtgacact 5040 
tcccaagaag gccaaacctc taaccgacat tcctgaaata cgtggcatta ttcttttttg 5100 
gatttctcat ttaggaaggc taaccctctg ttgamtgtam kccttttggt ttgggctgta 5160 
ttgaaatcct ttctaaattg catgaatagg ctctgctaac cgtgatgaga caaactgaaa 5220 
attattgcaa gcattgacta taattatgca gtacgttctc aggatgcatc caggggttca 5280 
ttttcatgag cctgtccagg ttagtttact cctgaccact aatagcattg tcatttgggc 5340 
tttctgttga atgaatcaac aaaccacaat acttcctggg accttttgta ctttatttga 5400 
acLatgagtc ri: taatrttc cctgatgatg gtggctgtaa tatgttgagt tcagtttact 5460 
aaaggtttta ctattatggt ttgaagggag tctcatgacc tctcagaaaa ggtgcacctc 5520 
cctgaaattg catatatgta tatagacatg cacacgtgtg catttgtttg tatacatata 5580 
tttgtccttc gtatagcaag ttttttgctc atcagcagag agcaacagat gttttattga 5640 
gtgaagcctt aaaaagcaca caccacacac agctaactgc caaaatacat tgaccgtagt 5700 
agctgttcaa ctcctagtac ttagaaatac acgtatggtt aatgttcagt ccaacaaacc 5760 
acacacagta aatgtttatt aatagtcatg gttcgtattt taggtgactg aaattgcaac 5820 
agtgatcata atgaggtttg ttaaaatgat agctatattc aaaatgtcta tatgtttatt 5880 
tggacttttg aggttaaaga cagtcatata aacgtcctgt ttctgtttta atgttatcat 5940 
agaatttttt aatgaaacta aattcaattg aaataaatga tagttttcat ctccaaaaaa 6000 
aaaaaaaaag ggcggcccgc tcgagtctag agggcccggt ttaaacccgc tgatcagcct 6060 
cgactgtgcc ttctagttgc cagccatctg ttgtttggcc ctcccccgtg ccttccttga 6120 
ccctggaagg ggccactccc 6140 

<210> 537 
<211> 1228 
<212> PRT 

<213> Homo sapiens 
<400> 537 

Met Leu Pro Val Tyr Gin Glu Val Lys Pro Asn Pro Leu Gin Asp Ala 
5 10 15 

Asn Leu Cys Ser Arg Val Phe Phe Trp Trp Leu Asn Pro Leu Phe Lys 
20 25 30 
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He Glv His Lys Arg Arg Leu Giu Glu Asp Asp Met Tyr Ser Val Leu 
35 " 40 45 

Pro Glu Asp Arg Ser Gin His Leu Gly Glu Glu Leu Gin Gly Phe Trp 
50 55 60 

Asp Lys Glu Val Leu Ara Ala Glu Asn Asp Ala Gin Lys Pro Ser Leu 

65 70 75 80 

Thr Arg Ala He He Lvs Cys Tyr Trp Lys Ser Tyr Leu Val Leu Gly 
85 90 95 

He Phe Thr Leu He Giu Glu Ser Ala Lys Val He Gin Pro He Phe 
100 105 110 

Leu Gly Lys He He Asn Tyr Phe Glu Asn Tyr Asp Pro Met Asp Ser 

115 120 125 

Val Ala Leu Asn Thr Ala Tyr Ala Tyr Ala Thr Val Leu Thr Phe Cys 
130 135 140 

Thr Leu He Leu Ala He Leu His His Leu Tyr Phe Tyr His Val Gin 
145 150 155 160 

Cys Ala Gly Met Arg Leu Arg Val Ala Met Cys His Met He Tyr Arg 
165 170 175 

Lys Ala Leu Tlrg Leu Ser Asn Met Ala Met Gly Lys Thr Thr Thr Gly 
180 185 190 

Gin Tie Val Asn Leu Leu Ser Asn Asp Val Asn Lys Phe Asp Gin Val 
195 200 205 

Thr Val Phe Leu His Phe Leu Trp Ala Gly Pro Leu Gin Ala He Ala 
210 215 220 

Val Thr Ala Leu Leu Trp Met Glu He Gly He Ser Cys Leu Ala Gly 
225 230 235 240 

Met Ala Val Leu He He Leu Leu Pro Leu Gin Ser Cys Phe Gly Lys 
245 250 255 

Leu Phe Ser Ser Leu Arg Ser Lys Thr Ala Thr Phe Thr Asp 7U.a Arg 
260 265 270 

He Arg Thr Met Asn Glu Val He Thr Gly He Arg He He Lys Met 

275 280 285 

Tyr Ala Trp Glu Lys Ser Phe Ser Asn Leu He Thr Asn Leu Arg Lys 
290 295 300 

Lys Glu He Ser Lys He Leu Arg Ser Ser Cys Leu Arg Gly Met Asn 
305 310 315 320 

Leu Ala Ser Phe Phe Ser Ala Ser Lys He He Val Phe Val Thr Phe 
325 330 335 



Thr Thr Tyr Val Leu Leu Gly Ser Val He Thr Ala Ser Arg Val Phe 
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340 345 350 

Val Ala Val Thr Leu Tyr Gly Ala Val Arg Leu Thr Val Thr Leu Phe 
355 360 365 

Phe Pro Ser Ala He Glu Arg Val Ser Glu Ala He Val Ser He Arg 
370 375 380 

Arg He Gin Thr Phe Leu Leu Leu Asp Glu He Ser Gin Arg Asn Arg 
385 390 395 400 

Gin Leu Pro Ser Asp Gly Lys Lys Met Val His Val Gin Asp Phe Thr 
405 410 415 

Ala Phe Trp Asp Lys Ala Ser Glu Thr Pro Thr Leu Gin Gly Leu Ser 
420 425 430 

Phe Thr Val Arg Pro Gly Glu Leu Leu Ala Val Val Gly Pro Val Gly 
435 440 445 

Ala Gly Lys Ser Ser Leu Leu Ser Ala Val Leu Gly Glu Leu Ala Pro 
450 455 460 

Ser His Gly Leu Val Ser Val His Gly Arg He Ala Tyr Val Ser Gin 
465 470 475 480 

Gin Pro Trp Val Phe Ser Gly Thr Leu Arg Ser Asn He Leu Phe Gly 
485 490 495 

Lys Lys Tyr Glu Lys Glu Arg Tyr Glu Lys Val He Lys Ala Cys Ala 
500 505 510 

'jr- ijcu jUCU oO-U x-i^p taJ.y >i^p Ijeu ITlZ V ax Ixe 

515 520 525 

Gly Asp Arg Gly Thr Thr Leu Ser Gly Gly Gin Lys Ala Arg Val Asn 
530 535 540 

Leu Ala Arg Ala Val Tyr Gin Asp Ala Asp He Tyr Leu Leu Asp Asp 

550 555 560 

Pro Leu Ser Ala Val Asp Ala Glu Val Ser Arg His Leu Phe Glu Leu 
565 570 575 

Cys He Cys Gin He Leu His Glu Lys He Thr He Leu Val Thr His 
580 585 590 

Gin Leu Gin Tyr Leu Lys Ala Ala Ser Gin He Leu He Leu Lys Asp 
595 600 605 

Gly Lys Met Val Gin Lys Gly Thr Tyr Thr Glu Phe Leu Lys Ser Gly 
610 615 620 

He Asp Phe Gly Ser Leu Leu Lys Lys Asp Asn Glu Glu Ser Glu Gin 
625 630 635 640 

Pro Pro Val Pro Gly Thr Pro Thr Leu Arg Asn Arg Thr Phe Ser Glu 
645 650 655 
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Ser Ser Val Trp Ser Gin Gin Ser Ser Arg Pro Ser Leu Lys Asp Gly 

660 665 67C 

Ala Leu Glu Ser Gin Asp Thr Glu Asn Val Pro Val Thr Leu Ser Glu 
675 680 685 

Glu A.sn Arg Ser Glu Gly Lys Val Gly Phe Gin Ala Tyr Lys Asn Tyr 
690 695 700 

Phe Arg Ala Gly Ala His Trp lie Val Phe lie Phe Leu lie Leu Leu 
705 710 ^ 715 720 

Asn Thr Ala Ala Gin Val Ala Tyr Val Leu Gin Asp Trp Trp Leu Ser 
725 730 735 

Tyr Trp Ala Asn Lys Gin Ser Met Leu Asn Val Thr Val Asn Gly Gly 
740 745 750 

Gly Asn Val Thr Glu Lys Leu Asp Leu Asn Trp Tyr Leu Gly lie Tyr 
755 760 765 

Ser Gly Leu Thr Val Ala Thr Val Leu Phe Gly He Ala Arg Ser Leu 
770 775 780 

Leu Val Phe Tyr Val Leu Val Asn Ser Ser Gin Thr Leu His Asn Lys 
785 790 795 800 

Met Phe Glu Ser He Leu Lys Ala Pro Val Leu Phe Phe Asp Arg Asn 
805 810 815 

Pro He Gly Arg He Leu Asn Arg Phe Ser Lys Asp He Gly His Leu 
820 825 830 

Asp Asp Leu Leu Pro Leu Thr Phe Leu Asp Phe He Gin Thr Leu Leu 
835 840 845 

Gin Val Val Gly Val Val Ser Val Ala Val Ala Val He Pro Trp He 
850 855 860 

Ala He Pro Leu Val Pro Leu Gly He He Phe He Phe Leu Arg Arg 
865 870 875 880 

Tyr Phe Leu Glu Thr Ser Arg Asp Val Lys Arg Leu Glu Ser Thr Thr 
885 890 895 

Arg Ser Pro Val Phe Ser His Leu Ser Ser Ser Leu Gin Gly Leu Trp 
900 905 910 

Thr He Arg Ala Tyr Lys Ala Glu Glu Arg Cys Gin Glu Leu Phe Asp 
915 920 925 

Ala His Gin Asp Leu His Ser Glu Ala Trp Phe Leu Phe Leu Thr Thr 
930 935 940 



Ser Arg Trp Phe Ala Val Arg Leu Asp Ala He Cys Ala Met Phe Val 
945 950 ^ 955 960 
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Tie lie Val Ala Phe Gly Ser Leu lie Leu Ala Lys Thr Leu Asp Ala 
965 970 975 

Gly Gin Val Gly Leu Ala Leu Ser Tyr Ala Leu Thr Leu Met Gly Met 
980 985 990 

Phe Gin Trp Cys Val Arg Gin Ser Ala Glu Val Glu Asn Met Met lie 
995 1000 1005 

Ser Val Glu Arg Val lie Glu Tyr Thr Asp Leu Glu Lys Glu Ala Pro 
1010 1015 1020 

Trp Glu Tyr Gin Lys Arg Pro Pro Pro Ala Trp Pro His Glu Gly Val 
1025 1030 1035 1040 

lie lie Phe Asp Asn Val Asn Phe Met Tyr Ser Pro Gly Gly Pro Leu 
1045 1050 1055 

Val Leu Lys His Leu Thr Ala Leu lie Lys Ser Gin Glu Lys Val Gly 
1060 1065 1070 

lie Val Gly Arg Thr Gly Ala Gly Lys Ser Ser Leu lie Ser Ala Leu 
1075 1080 1085 

Phe Arg Leu Ser Glu Pro Glu Gly Lys lie Trp lie Asp Lys lie Leu 
1090 1095 1100 

Thr Thr Glu lie Gly Leu His Asp Leu Arg Lys Lys Met Ser lie lie 
1105 1110 1115 1120 

Pro Gin Glu Pro Val Leu Phe Thr Gly Thr Met Arg Lys Asn Leu Asp 
1125 1130 1135 

Pro Phe Asn Glu His Thr Asp Glu Glu Leu Trp Asn Ala Leu Gin Glu 
1140 1145 1150 

Val Gin Leu Lys Glu Thr lie Glu Asp Leu Pro Gly Lys Met Asp Thr 
1155 1160 1165 

Glu Leu Ala Glu Ser Gly Ser Asn Phe Ser Val Gly Gin Arg Gin Leu 
1170 1175 1180 

Val Cys Leu Ala Arg Ala He Leu Arg Lys Asn Gin He Leu He He 
1185 1190 1195 1200 

Asp Glu Ala Thr Ala Asn Val Asp Pro Arg Thr Asp Glu Leu He Gin 
1205 1210 1215 

Lys Lys Ser Gly Arg Asn Leu Pro Thr Ala Pro Cys 
1220 1225 

<210> 538 
<211> 1261 
<212> PRT 

<213> Homo sapiens 
<400> 538 

Met Tyr Ser Val Leu Pro Glu Asp Arg Ser Gin His Leu Gly Glu Glu 
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5 10 15 

Leu Gin Gly Phe Trp Asp Lys Glu Val Leu Arg Ala Glu Asn Asp Ala 
20 25 30 

Gin Lys Pro Ser Leu Thr Arg Ala lie lie Lys Cys Tyr Trp Lys Ser 

35 40 4b 

Tyr Leu Val Leu Gly lie Phe Thr Leu lie Glu Glu Ser Ala Lys Val 
50 55 60 

lie Gin Pro lie Phe Leu Gly Lys lie lie Asn Tyr Phe Glu Asn Tyr 
65 70 75 80 

Asp Pro Met Asp Ser Val Ala Leu Asn Thr Ala Tyr Ala Tyr Ala Thr 
85 90 95 

Val Leu Thr Phe Cys Thr Leu lie Leu Ala lie Leu His His Leu Tyr 
100 105 110 

Phe Tyr His Val Gin Cys Ala Gly Met Arg Leu Arg Val Ala Met Cys 
115 120 " 125 

His Met lie Tyr Arg Lys Ala Leu Arg Leu Ser Asn Met Ala Met Gly 
130 135 140 

Lys Thr Thr Thr Gly Gin lie Val Asn Leu Leu Ser Asn Asp Val Asn 
145 150 155 160 

Lys Phe Asp Gin Val Thr Val Phe Leu His Phe Leu Trp Ala Gly Pro 
165 170 175 

Leu Gin Ala He Ala Val Thr Ala Leu Leu Trp Met Glu He Gly He 
180 185 190 

Ser Cys Leu Ala Gly Met Ala Val Leu He He Leu Leu Pro Leu Gin 
195 200 205 

Ser Cys Phe Gly Lys Leu Phe Ser Ser Leu Arg Ser Lys Thr Ala Thr 
210 215 220 

Phe Thr Asp Ala Arg He Arg Thr Met Asn Glu Val He Thr Gly He 

225 230 235 240 

Arg He He Lys Met Tyr Ala Trp Glu Lys Ser Phe Ser Asn Leu He 
245 250 255 

Thr Asn Leu Arg Lys Lys Glu He Ser Lys He Leu Arg Ser Ser Cys 
260 265 270 

Leu Arg Gly Met Asn Leu Ala Ser Phe Phe Ser Ala Ser Lys He He 
275 280 285 

Val Phe Val Thr Phe Thr Thr Tyr Val Leu Leu Gly Ser Val He Thr 
290 295 300 

Ala Ser Arg Val Phe Val Ala Val Thr Leu Tyr Gly A^la Val Arg Leu 
305 310 315 320 
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Thr Val Thr Leu Phe Phe Pro Ser Ala He Glu Arg Val Ser Glu Ala 
325 330 335 

He Val Ser He Arg Arg He Gin Thr Phe Leu Leu Leu Asp Glu He 

340 345 350 

Ser Gin Arg Asn Arg Gin Leu Pro Ser Asp Gly Lys Lys Met Val His 
355 360 365 

Val Gin Asp Phe Thr Ala Phe Trp Asp Lys Ala Ser Glu Thr Pro Thr 
370 375 380 

Leu Gin Gly Leu Ser Phe Thr Val Arg Pro Gly Glu Leu Leu Ala Val 
385 390 395 400 

Val Gly Pro Val Gly Ala Gly Lys Ser Ser Leu Leu Ser Ala Val Leu 
405 410 415 

Gly Glu Leu Ala Pro Ser His Gly Leu Val Ser Val His Gly Arg He 
420 425 430 

Ala Tyr Val Ser Gin Gin Pro Trp Val Phe Ser Gly Thr Leu Arg Ser 
435 440 445 

Asn He Leu Phe Gly Lys Lys Tyr Glu Lys Glu Arg Tyr Glu Lys Val 
450 455 460 

He Lys Ala Cys Ala Leu Lys Lys Asp Leu Gin Leu Leu Glu Asp Gly 
465 470 475 480 

Asp Leu Thr Val He Gly Asp Arg Gly Thr Thr Leu Ser Gly Gly Gin 
485 490 495 

Lys Ala Arg Val Asn Leu Ala Arg Ala Val Tyr Gin Asp Ala Asp He 
500 505 510 

Tyr Leu Leu Asp Asp Pro Leu Ser Ala Val Asp Ala Glu Val Ser Arg 
515 520 525 

His Leu Phe Glu Leu Cys He Cys Gin He Leu His Glu Lys He Thr 
530 535 540 

He Leu Val Thr His Gin Leu Gin Tyr Leu Lys Ala Ala Ser Gin He 
545 550 555 560 

Leu He Leu Lys Asp Gly Lys Met Val Gin Lys Gly Thr Tyr Thr Glu 



Phe Leu Lys Ser Gly He Asp Phe Gly Ser Leu Leu Lys Lys Asp Asn 

■ 580 585 590 

Glu Glu Ser Glu Gin Pro Pro Val Pro Gly Thr Pro Thr Leu Arg Asn 

595 600 605 

Arg Thr Phe Ser Glu Ser Ser Val Trp Ser Gin Gin Ser Ser Arg Pro 



565 



570 



575 



610 



615 



620 



BNSDOCID <WO 



0151633A2 I > 



wo in/51633 

196 



PC T/l S01/(M574 



Ser Leu Lys Asp Gly Ala Leu Glu Ser Gin Asp Thr Glu Asn Val Pro 
625 630 635 640 

Val Thr Leu Ser Glu Glu Asn Arg Ser Glu Gly Lys Val Gly Phe Gin 
645 ^ 650 655 

^.la Tyr Lys Asn Tyr Phe Arg Ala Gly Ala His Trp lie Val Phe lie 
660 665 670 

Phe Leu lie Leu Leu Asn Thr Ala Ala Gin Val Ala Tyr Val Leu Gin 
675 680 685 

Asp Trp Trp Leu Ser Tyr Trp Ala Asn Lys Gin Ser Met Leu Asn Val 
690 ^ 695 700 

Thr Val Asn Gly Gly Gly Asn Val Thr Glu Lys Leu Asp Leu Asn Trp 
705 710 715 720 

Tyr Leu Gly He Tyr Ser Gly Leu Thr Val Ala Thr Val Leu Phe Gly 

725 730 735 

Tie Ala Arg Ser Leu Leu Val Phe Tyr Val Leu Val Asn Ser Ser Gin 
740 745 750 

Thr Leu His Asn Lys Met Phe Glu Ser He Leu Lys Ala Pro Val Leu 
755 760 765 

Phe Phe Asp Arg Asn Pro He Gly Arg He Leu Asn Arg Phe Ser Lys 
770 775 780 

Asp He Gly His Leu Asp Asp Leu Leu Pro Leu Thr Phe Leu Asp Phe 
785 790 795 800 

rie Gin Thr Leu Leu Gin Val Val Gly Val Val Ser Val Ala Val Ala 
805 810 815 

Val He Pro Trp He Ala He Pro Leu Val Pro Leu Gly He He Phe 
820 825 830 

He Phe Leu Arg Arg Tyr Phe Leu Glu Thr Ser Arg Asp Val Lys Arg 
835 840 845 

Lou Glu Ser Thr Thr Arg Ser Pro Val Phe Ser His Leu Ser Ser Ser 

850 855 860 

Leu Gin Gly Leu Trp Thr He Arg Ala Tyr Lys Ala Glu Glu Arg Cys 

865 870 875 880 

Gin Glu Leu Phe Asp Ala His Gin Asp Leu His Ser Glu Ala Trp Phe 
885 890 895 

Leu Phe Leu Thr Thr Ser Arg Trp Phe Ala Val Arg Leu Asp Ala He 

900 905 910 

Cys Ala Met Phe Val He He Val Ala Phe Gly Ser Leu He Leu Ala 
915 920 925 

Lys Thr Leu Asp Ala Gly Gin Val Gly Leu Ala Leu Ser Tyr Ala Leu 
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Thr Leu Met Gly 
945 

Glu Asn Met Met 

Glu Lys Glu Ala 
980 



935 

Met Phe Gin Trp 
950 

He Ser Val Glu 
965 

Pro Trp Glu Tyr 



940 

Cys Val Arg Gin 

955 

Arg Val He Glu 
970 

Gin Lys Arg Pro 
985 



Ser Ala Glu Val 
960 

Tyr Thr Asp Leu 

975 

Pro Pro Ala Trp 
990 



Pro His Glu Gly Val He He Phe Asp Asn Val Asn Phe Met Tyr Ser 
995 1000 1005 

Pro Gly Gly Pro Leu Val Leu Lys His Leu Thr Ala Leu He Lys Ser 
1010 1015 1020 

Gin Glu Lys Val Gly He Val Gly Arg Thr Gly Ala Gly Lys Ser Ser 
1025 1030 1035 1040 

Leu He Ser Ala Leu Phe Arg Leu Ser Glu Pro Glu Gly Lys He Trp 
1045 1050 1055 

He Asp Lys He Leu Thr Thr Glu He Gly Leu His Asp Leu Arg Lys 
1060 1065 1070 

Lys Met Ser He He Pro Gin Glu Pro Val Leu Phe Thr Gly Thr Met 
1075 1080 1085 

Arg Lys Asn Leu Asp Pro Phe Asn Glu His Thr Asp Glu Glu Leu Trp 
1090 1095 1100 

Asn Ala Leu Gin Glu Val Gin Leu Lys Giu Thr He Glu Asp Leu Pro 
1105 1110 1115 1120 

Gly Lys Met Asp Thr Glu Leu Ala Glu Ser Gly Ser Asn Phe Ser Val 
1125 1130 1135 

Gly Gin Arg Gin Leu Val Cys Leu Ala Arg Ala He Leu Arg Lys Asn 
1140 1145 1150 

Gin He Leu He He Asp Glu Ala Thr Ala Asn Val Asp Pro Arg Thr 
1155 1160 1165 

Asp Glu Leu He Gin Lys Lys He Arg Glu Lys Phe Ala His Cys Thr 
1170 1175 1180 

Val Leu Thr He Ala His Arg Leu Asn Thr He He Asp Ser Asp Lys 
1185 1190 1195 1200 

He Met Val Leu Asp Ser Gly Arg Leu Lys Glu Tyr Asp Glu Pro Tyr 
1205 1210 1215 

Val Leu Leu Gin Asn Lys Glu Ser Leu Phe Tyr Lys Met Val Gin Gin 
1220 1225 1230 

Leu Gly Lys Ala Glu Ala Ala Ala Leu Thr Glu Thr Ala Lys Gin Arg 
1235 1240 1245 



BNSDOCID <rWO 0151633A2 I 



wo 01/51633 PC T/l SOl/01574 

198 



Trc Gly Phe Thr K^t Leu Ala Arg Leu Val Ser Asn Ser 
1250 1255 ^ 1260 

<210> 539 
:2i:> 10 
-212> PRT 

'21 3> Artificia.] Sequence 
v:220> 

<22 3> Made in a lab 
<400> 539 

Cys Leu Scr His Ser Val Ala Val Val Thr 
1 5 10 

<210> 540 
<211> 9 
<212> PRT 

<213> Artiticidl Sequence 

<220> 

':223> Made ma luo 
v400> 540 

Ala Val Val Thr Al<^ Ser Ala Ala Leu 
1 5 

<210> 541 
<211> 14 
<212> PRT 

<213> Homo sapiens 
:400> 541 

Leu Ala Gly Leu Leu Cys Pro Asp Pro Arg Pro Leu Glu Leu 
5 10 



<210> 542 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 542 

Thr Gin Val Val Phe Asp Lys Ser Asp Leu Ala Lys Tyr Ser Ala 
5 10 15 



.210> 54 3 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 543 

Phe Met Gly Ser He Val Gin Leu Ser Gin Ser Val 
5 10 
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<210> 544 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 544 

Thr Tyr Val Pro Pro Leu Leu Leu Glu Val Gly Val Glu Glu Lys Phe 
5 10 15 

Met Thr 



<210> 545 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 545 

Wet Asp Arg Leu Val Gin Arg Phe Gly Thr Arg Ala Val Tyr Leu Ala 
5 10 15 

Ser Val 



<210> 546 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 546 

Phe Val Gly Glu Gly Leu Tyx Gin Gly Val Pro Arg Ala Glu Pro Gly 
5 10 15 

Thr Glu Ala Arg Arg His Tyr Asp Glu Gly Val Arg Met 
20 25 



<210> 547 

<211> 58 

<212> PRT 

<213> Homo sapiens 

<400> 547 

Val Ala Glu Glu Ala Ala Leu Gly Pro Thr Glu Pro Ala Glu Gly Leu 
S 10 15 

Ser Ala Pro Ser Leu Ser Pro His Cys Cys Pro Cys Arg Ala Arg Leu 
20 25 30 

Ala Phe Arg Asn Leu Gly Ala Leu Leu Pro Arg Leu His Gin Leu Cys 
35 40 45 

Cys Arg Met Pro Arg Thr Leu Arg Arg Leu 
50 55 
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<.:iJO> 548 
<211> 18 
^212^' PRT 

-2i3> Homo sapiens 
■:400> 548 

lie Asp Trp Asp Thr Ser Ala Leu Ala Pro Tyr Leu Gly Thr Gin Glu 
5 10 15 

GJ u Cys 



<210> 54 9 
<211> 18 
<212> PRT 

<213> Homo sapiens 
-400> 549 

Leu Glu Ala Leu Leu Ser Asp Leu Phe Arg Asp Pro Asp His Cys Arg 
5 10 15 

Gin Ala 



<210> 550 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 550 

Ser Asp His Trp Arg Gly Arg Tyr Gly Ajrg Arg Arg Pro Phe 
5 10 



<210> 551 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<22C> 

<223> Made in a lab 
<400> 551 

Phe Asp Lys Ser Asp Leu Ala Lys Tyr Ser Ala 
5 ' 10 

<210> 552 

<211> 2577 

<212> DNA 

<213> Homo sapiens 

v400> 552 

agcatatgta acatgacctg tgcttcagtg ttcttttgtg atcaaaaatt ccttactttt 60 

agltttttat ctatggtaga accacccaga gcaggggtcc tcaactccca ggccacagac 120 

tcataccagt ccacggacta ttatgaacca caccacacag gaggaggtqa gcactaggca 180 

agccaaggaa gcttcacctg tacttacagc cacacgccat ggctcatatt acagcctgaa 240 
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ctctgcctcc actcagatca gtgataacat tagaaactca ttggagcacg aaccctgttg 300 
tgaactgcct atccgaagga tctaggttgt gtgcttcgta tgagaatcta atgccagatg 360 
atctatcatt gtctcacttt gcccccagat aagaccatct agttgcagaa aaataagctc 420 
agagcttcca ctgattctac attatggata tgtgccgccg aagcaagcac aaagccctac 480 
ttttacacat gcctagtgat gcttcatgga caaggcttgg ctctgttgag tccaactaac 540 
ctacctgaga ttctgagatt tctcttcaat ggcttcctgt gagctagagt ttgaaaatat 600 
cttaaaatct tgagctagag atggaagtag cttggacgat tttcattatc atgtaaatcg 660 
ggtcactcaa ggggccaacc acagctggga gccactgctc aggggaaggt tcatatggga 720 
ctttctactg cccaaggttc tatacaggat ataaaggtgc ctcacagtat agatctggta 780 
gcaaagaaga agaaacaaac actgatctct ttctgccacc cctctgaccc tttggaactc 840 
ctctgaccct ttagaacaag cctacctaat atctgctaga gaaaagacca acaacggcct 900 
caaaggatct cttaccatga aggtctcagc taattcttgg ctaagatgtg ggttccacat 960 
taggttctga atatgggggg aagggtcaat ttgctcattt tgtgtgtgga taaagtcagg 1020 
atgcccaggg gccagagcag ggggctgctg ctttgggaac aatggctgag catataacca 1080 
taggtatggg aacaaaaaac atcaaagtca ctgtatcaat tgccatgaag actcgaggga 1140 
cctgaatcta ccgattcatc ttaaggcagc aggaccagtt tgagtggcaa caatgcagca 1200 
gcagaatcaa tggaaacaac agaatgattg caatgtcctt ttttttctcc tccttctgac 1260 
ttgataaaag qjaccgtctt ccttggattt agtgaacccc tttggttcct gaaaaattca 1320 
aggagtatct aggacatagt ccccagaaga cagtacaaga ctttctgata aactggacat 1380 
ttcaagrccc a^ataactaa tcagaaaaat caaagatgtg atactatttt ttatcccatg 1440 
cataggtgct acacttggat caaatgaaca atgttgggat ctytatggat aaaggtctta 1500 
aaagtcctga gataaaqaat cctgcaccca ctggtacttc taacttgtct tgttttttgt 1560 
ctgwtttctg gctgatgcag gggactaact cactgccacg cgaaaactac ctgaactgaa 1620 
ctatgacatc tcacctgata tgtaagatgt aactgttata attattttaa acctcaattt 1680 
agcattaact agccttttaa tgtaaacact tacacattat gaygactaga aacagcatac 1740 
tctctggccg tctgtccaga tagatcttga gaagatacat caatgttttg ctcaagtaga 1800 
aggctgacta tacttgccga tccacaacat acagcaagta tgagagcagt tctaaaatga 1860 
cagagatagg aacagtaata aagttattkt aaaagctaat ttgatatact ttaccaattt 1920 
aacatcttgc ctgtccgtgc agaatcaaac atttacatgc actaaaagac ataagcatct 1980 
tcagtgctca agtgttcatc tttgtaaaat accaccaagg ttaaaaggaa gggacaaaaa 2040 
aaaaaaaccc tcttatctca gtggggtatt gcatagcaga agctactaat ttgaagtcct 2100 
ttgatggaca agaaacaata ttagggccac ttatctgaaa tgaacaaaga tttaagtgaa 2160 
gatttcatca cagcttccct agactgatat gctgtaatag aaaatcagct agggggtaaa 2220 
ataaataaga gctctctgca tgctgaaagc aagtaagatt aataataatg gtaagaatag 2280 
tagtcacagg agtttcagtt aatgatgcca ataagcatgt gctaggcact gaattaaatg 2340 
ccacatatat ctttcttatg cgcagcaaac tttgaaggat atattctcct acttttcata 2400 
tatgacaaca tatttggtgg taaataacgt tcccaaggtc acacacctag caagtaagaa 2460 
agttaggaat taaacccagt attgtgtgaa tctaaagcct aacttttttc tctttatcac 2520 
ccacctacgg cttgtcttca ttaaaggaaa agtgtatcca cttaaaaaaa aaaaaaa 2577 

<210> 553 
<211> 58 
<212> PRT 

<213> Homo sapiens 
<400> 553 

Ser He Cys Asn Met Thr Cys Ala Ser Val Phe Phe Cys Asp Gin Lys 
S 10 15 

Phe Leu Thr Phe Ser Phe Leu Ser Met Val Glu Pro Pro Arg Ala Gly 
20 25 30 

Val Leu Asn Ser Gin Ala Thr Asp Ser Tyr Gin Ser Thr Asp Tyr Tyr 
35 40 45 

Glu Pro His His Thr Gly Gly Gly Glu His 
50 55 
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<210> 5b4 
-.211 > 5 9 
<212> PRT 

<2i?> Hono sapiens 

<40Q> 554 

Leu Gin Lys Asn Lys Leu Arg A.la Ser Thr Asp Ser Thr Leu Trp He 
5 10 15 

Cys Ala Ala Glu Ala Ser Thr Lys Pro Tyr Phe Tyr Thr Cys Leu Val 
20 25 30 

Kot Leu His Gly Gin Gly Leu Ala Leu Leu Ser Pro Thr Asn Leu Pro 
35 40 45 

Glu He Leu Arg Phe Leu Phe Asn Gly Phe Leu 

50 55 



<210> 555 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<400> 555 

Leu Gly Arg Phe Ser Leu Ser Cys Lys Ser Gly His Ser Arg Gly Gin 
5 10 15 

Pro Gin Leu Gly Ala Thr Ala Gin Gly Lys Val His Met Gly Leu Ser 
20 25 30 

Thr Ala Gin Gly Ser He Gin Asp He Lys Val Pro His Ser He Asp 
35 40 45 

Leu Val Ala Lys Lys Lys Lys Gin Thr Leu He Ser Phe Cys His Pro 

50 55 60 

Ser Asp Pro Leu Glu Leu Leu 

70 



65 




<210> 


556 


<211> 


81 


<212> 


PRT 


<213> 


Homo sapiens 


<400> 


556 


Asn His Pro Glu Gin 



5 10 15 

Ser Pro Arg Thr He Met Asn His Thr Thr Gin Glu Glu Val Ser Thr 
20 25 30 

Arg Gin Ala Lys Glu Ala Ser Pro Val Leu Thr Ala Thr Ajrg His Gly 
35 40 45 

Ser Tyr Tyr Ser Leu Asn Ser Ala Ser Thr Gin He Ser Asp Asn He 
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50 



55 



60 



Arg Asn Ser Leu Glu His Glu Pro Cys Cys Glu Leu Pro He Arg Arg 
65 70 75 80 



He 



<210> 557 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<400> 557 

Ser Leu Ser Ala Thr Pro Leu Thr Leu Trp Asn Ser Ser Asp Pro Leu 
5 10 15 

Glu Gin Ala Tyr Leu He Ser Ala Arg Glu Lys Thr Asn Asn Gly Leu 
20 25 30 

Lys Gly Ser Leu Thr Met Lys Val Ser Ala Asn Ser Trp Leu Arg Cys 
35 40 45 

Gly Phe His He Arg Phe 
50 



<210> 558 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARITUSIT 

<222> (1) . . . (77) 

<223> Xaa = Any amino acid 

<400> 558 

Asn Asp Arg Asp Arg Asn Ser Asn 

5 

He Tyr Phe Thr Asn Leu Thr Ser 
20 

Phe Thr Cys Thr Lys Arg His Lys 
35 40 

Leu Cys Lys He Pro Pro Arg Leu 
50 55 

Pro Ser Tyr Leu Ser Gly Val Leu 
65 70 



Lys Val He Xaa Lys Ala Asn Leu 
10 15 

Cys Leu Ser Val Gin Asn Gin Thr 
25 30 

His Leu Gin Cys Ser Ser Val His 
45 

Lys Gly Arg Asp Lys Lys Lys Lys 
60 

His Ser Arg Ser Tyr 
75 



<210> 559 
<211> 50 
<212> PRT 
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<213> Homo sapiens 
<4C0> 559 

Thr Leu Pro Pro Leu Arg Ser Val He Thr Leu Glu Thr His Trp Ser 
5 10 15 

Thr Asn Pro Val Val Asn Cys Leu Ser Glu Gly Ser Arg Leu Cys Ala 
20 25 30 

Ser Tyr Glu Asn Leu Met Pro Asp Asp Leu Ser Leu Ser His Phe Ala 
35 40 45 

Pro Arg 
50 



<21Q> 560 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 560 

He Gly Ser Leu Lys Gly Pro Thr Thr Ala Gly Ser His Cys Ser Gly 
5 10 15 

Glu Gly Ser Tyr Gly Thr Phe Tyr Cys Pro Arg Phe Tyr Thr Gly Tyr 
20 25 30 

Lys Gly Ala Ser Gin Tyr Arg Ser Gly Ser Lys Glu Glu Glu Thr Asn 
35 40 45 

Thr Asp Leu Phe Leu Pro Pro Leu 
50 55 



<210> 561 

<211> 57 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 

<222> (1) ... (57) 

<223> Xaa = Any amino acid 

<400> 561 

Val Leu His Leu Asp Gin Met Asn Asn Val Gly He Xaa Met Asp Lys 
5 10 15 

Gly Leu Lys Ser Pro Glu He Lys Asn Pro Ala Pro Thr Gly Thr Ser 
20 25 30 

Asn Leu Ser Cys Phe Leu Ser Xaa Phe Trp Leu Met Gin Gly Thr Asn 
35 40 45 

Ser Leu Pro Arg Glu Asn Tyr Leu Asn 
50 55 
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<210> 562 
<211> 59 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 

<222> (1) . . . (59) 

<223> Xaa = Any amino acid 

<400> 562 

Asp Leu Tyr Pro Xaa Arg Ser Gin His Cys Ser Phe Asp Pro Ser Val 
5 10 15 

Ala Pro Met His Gly lie Lys Asn Ser lie Thr Ser Leu lie Phe Leu 
20 25 30 

lie Ser Tyr Leu Xaa Leu Glu Met Ser Ser Leu Ser Glu Ser Leu Val 
35 40 45 

Leu Ser Ser Gly Asp Tyr Val Leu Asp Thr Pro 
50 55 



<210> 563 
<211> 79 
<212> PRT 

<213> Homo sapiens 
<400> 563 

Cys Phe Leu Phe Pro Tyr Leu Trp Leu Tyr Ala Gin Pro Leu Phe Pro 

Lys Gin Gin Pro Pro Ala Leu Ala Pro Gly His Pro Asp Phe lie His 
20 25 30 

Thr Gin Asn Glu Gin lie Asp Pro Ser Pro His lie Gin Asn Leu Met 
35 40 45 

Trp Asn Pro His Leu Ser Gin Glu Leu Ala Glu Thr Phe Met Val Arg 
50 55 60 

Asp Pro Leu Arg Pro Leu Leu Val Phe Ser Leu Ala Asp lie Arg 
65 70 75 



<210> 564 
<211> 64 
<212> PRT 

<213> Homo sapiens 
<400> 564 

Ala Cys Ser Lys Gly Ser Glu Glu Phe Gin Arg Val Arg Gly Val Ala 
5 10 15 

Glu Arg Asp Gin Cys Leu Phe Leu Leu Leu Cys Tyr Gin lie Tyr Thr 
20 25 30 
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Val Arg His Leu 

35 

His Met Asn Leu 
50 



Tyr lie Leu Tyr 
40 

Pro Leu Ser Set 
55 



Arg Thr Leu Gly 

Gly Ser Gin Leu 
60 



Ser Arg Lys Ser 
45 

Trp Leu Ala Pro 



-^;210> 565 
•':21i> 57 

L. ' PRT 
<213> Hottlo sapiens 

<220> 

<221> VARIANT 

<222> {1) . . . (57) 

<223> Xaa = Any amino acid 

<400> 565 

Leu Tyr Tyr Cys Ser Tyr Leu Cys 
5 

Ala Val Cys Cys Gly Ser Ala Ser 
20 

Asn lie Asp Val Ser Ser Gin Asp 
35 40 

Tyr Ala Val Ser Ser Xaa His Asn 
50 55 



His Phe Arg Thr Ala Leu He Leu 
10 15 

He Val Ser Leu Leu Leu Glu Gin 
25 30 

Leu Ser Gly Gin Thr Ala Arg Glu 
45 

Val 



-210> 566 
<211:^^ 55 
<212> PRT 

<2]3> Homo sapiens 
<400> 566 

Tie Leu Leu Glu Phe Phe Arg Asn Gin Arg Gly Ser Leu Asn Pro Arg 
5 10 15 

Lys Thr Val Pro Phe He Lys Ser Glu Gly Gly Glu Lys Lys Gly His 
20 25 30 

Cy.r. Asn His Ser Val Val Ser He Asp Ser Ala Ala Ala Leu Leu Pro 
35 40 45 

Leu Lys Leu Val Leu Leu Pro 

50 55 



<210> 567 
-:2H> 51 
<.212> PRT 

<213> Homo sapiens 

''4 00> 567 

Tyr Ser Asp Phe Asp Val Phe Cys Ser His Thr Tyr Gly Tyr Met Leu 
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Ser His Cys Ser Gin Ser Ser Ser 
20 

Leu Thr Leu Ser Thr His Lys Met 
35 40 

Phe Arg Thr 
50 



10 15 

Pro Leu Leu Trp Pro Leu Gly lie 
25 30 

Ser Lys Leu Thr Leu Pro Pro lie 
45 



<210> 568 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<400> 568 

Lys Val Gly Glu Tyr He Leu Gin 
5 

Tyr Val Ala Phe Asn Ser Val Pro 
20 

Thr Glu Thr Pro Val Thr Thr He 
35 40 

Cys Phe Gin His Ala Glu Ser Ser 
50 55 

Phe Leu Leu Gin His He Ser Leu 
65 70 



Ser Leu Leu Arg He Arg Lys He 
10 15 

Ser Thr Cys Leu Leu Ala Ser Leu 
25 30 

Leu Thr He He He Asn Leu Thr 
45 

Tyr Leu Phe Tyr Pro Leu Ala Asp 
60 

Gly Lys Leu 
75 



<210> 569 
<211> 4809 
<212> DNA 

<213> Homo sapiens 
<400> 569 

gcatccagag tggtggactg gttacaggct 
cagagtccac rggttatgtt ggttcacatt 
ggacagatgt ccgataatcc tttttacatt 
aatggacttg cttcaaagtg gaggcaggca 
tttcagttgc ttttctaatt ctctcttatc 
ttccttttaa aatccttgtc tactttgcag 
gcacctactg actacacggt taggagtgca 
ggtctgtagc acccagcaaa gtgctcagta 
aaatacacac tgtactaaga atctacacac 
ctacaggtgt cacgcttggc atcacacatg 
gagctctgcc actgcatgtc cactagtgac 
cgctgctctc acataagggg tccaattaaa 
acttctcaag agctaagaag gtttgctgag 
actgctggcc aaaaatgatg agtatttccc 
gtccatcaca ttcatcattc atcagtctgg 
atgattggct gcacacttcc agactgatga 
cacatcttca gcaagagggg gaaatactca 



atgaacctac actgatgcgg caccaccacc 60 
tactcttgct gtggtatggt ctataggttt 120 
ttggcatcct tgggtagctc gtcttgtagg 180 
gatccttcag acgggtatat ggagccctgt 240 
gtttacctca aaatcttcct gaggtctcgc 300 
catcactctg acactcccat tgattcctca 360 
agggtagaat tcatgtttta ttcatctttg 420 
aatgcgcagt aattgatttg acctctgaac 480 
cgaaagacaa aaacaagaca aatttgagtg 540 
tgcctgtgta ttcctctagg tggttaccag 600 
gggttcgctc caccacccca gctgggtagc 660 
attgccagga ataaattccc ccggactttg 720 
tattctggca tgatgtttgg tgatcaaaca 780 
cctcttgctg aagatgtgct ccatacaata 840 
aagtgtgcag aacaacatgt aatagataat 900 
atgatgaatg tgatggacta ttgtatggag 960 
tcattttatc tattacatgt tgttctggtt 1020 



BNSCXX^ID: <WO 0151633A2 I_> 



wo 01/51633 PC T/l S(I1/<I1574 

208 



: tttttttrt 
at aagct tgc 
'I'.'aacicaat g 
I ctt taatct 
t ccacatact 
cgcagt ctga 
ctcatcacaa 
gcttattctt 
ggaaaagggc 
aacatgctat 
jgaacaa 
aaa t caat cc 
Htrt gaagaa 
act ttgaaat 
ggcctcaaag 
tcctgcaggc 
aaagagcaaa 
cctctaaatg 
aaattagcac 
ctagtgtgtg 
cagaaactgg 
attcctttgt 
Laaattacaa 
agcagaacga 
aggaggttga 
tgtggct cca 
ggttgaaggc 
taaattcctc 
Gcccccaccc 
agcctcatcc 
gccagtcatg 
gagcgaacag 
rial tgtttaa 
ttggacactg 
atacctgcaa 
aggctaagtg 
aagtttctac 
ggagtatttg 
acctacaggt 
cagcacagca 
tttggaaggg 
cattttacat 
catgattctc 
gggcaaatta 
ucttttgaat 
gttcctcctg 
atctacccca 
cctat aatgg 
ccagtgcctg 
ttagcaaggc 
cct ggagaac 
atcaacccaa 
.J t ttcctaga 
gcagaaaact 
c^jgi^gtctaa 
at gagat gca 
g g t g c c a g c t 
accactgtct 



tccaatgtcc 
caataaagag 
agggcactat 
t ccccagcaa 
t ttccccaac 
aaacagtaat 
ctccagccct 
gtgcttcat t 
tcagattcag 
gatgt tgggt 
ctaggccttt 
ttctaccat c 
acaaacatga 
agttgtaaga 
gacaggcagc 
tccctgctcc 
aataactttg 
tcaaaatgtt 
tgggcttata 
atgcttttt c 
tcaaaggtgt 
ggagtctggt 
aaaccctcca 
grtggatgta 
gaggtcctgt 
gtct gagtct 
aggagaagat 
cttcctccac 
ccacactagg 
caaaacactc 
ctgacacaca 
aaacattttc 
acctagt ct g 
atgactttat 
ctgagctggc 
caaaggaaag 
ttgagcagct 
ggataaagga 
aacgagacag 
gctgacatcc 
aaaatgggta 
ttatgcagtt 
aatcctttgt 
gagccttcca 
tagttttcca 
gaaacactcc 
ttactgcatg 
ggatctttca 
taggtgtgca 
acctgacaaa 
agctt ctact 
agggtggt cc 
tttggtttca 
ggtgaactag 
ggtatgcgtg 
aagggcagtg 
Lgaggai_gcL_ 
gttacctcac 



agcct aaact 
tcctctgtgg 
Tiggaaacata 
tccttgcttt 
cgtaacaaat 
t gt cct tggg 
gccct ccatg 
cctggtaagc 
ccgaccattg 
cccctt ctca 
tactccacca 
catgcatagc 
tttggtcctt 
gttaagaaaa 
aggatt ctac 
aagct cctag 
tacttgatgc 
tgcct ccttt 
ttgagtcaca 
tccaaactat 
ggcatacaca 
ttggtcttgg 
catttctt ca 
tgttttggaa 
gctctgccat 
gaaggcccaa 
ggatgtccca 
cttttgttcc 
gagggccatc 
tccagacaca 
gaactaacca 
ataattttca 
gccacacatt 
gcccagatta 
aaaaagaaaa 
aaaaacacca 
agaagactac 
ggaatctgat 
ctggacaaac 
tgtgtgacag 
caatgttgtc 
gatatattta 
tattttgtat 
tattgccaag 
gttcacacct 
ctaacaagca 
tactcaagag 
ctcaccctaa 
attaacctca 
gtcct t cagg 
gccagatgat 
ctgatggccc 
gactt gctcc 
aaaaggcctg 
ggagcctgca 
gacaaggagc 
gggcaagtcc 
gagttcaagt 



ataaagtact 
tatggaact g 
ctgtgctgca 
gt gcgcatt t 
tatttaactc 
aagaagtgag 
t ggtagcagc 
tcaatttctt 
tggcctctgt 
tccccttggg 
tccctttgtt 
aatttctaaa 
ttagtaaaca 
agcacaaaac 
agggtttgag 
ctaacagccc 
tgagtcagtg 
gaggcttctc 
gatctgggcc 
tgat acaaaa 
ttctcatgta 
ttttcttggt 
tggattgtat 
gagattatga 
ctgcaagctg 
gaaccagggg 
gctcagcagg 
attcaggcct 
tgctttactg 
cgcagaaatg 
tgacatggat 
attattttta 
attt cctggg 
cactggaggc 
ctaggcaagt 
actgcaggga 
aatgccactc 
gttggaggtc 
acatggaact 
ccttgaaagc 
tgccactttg 
tgtttcctgg 
tacaaaaagc 
gtgtatcaac 
accatttatt 
cccttgcagg 
tcagctttta 
agtgaggaca 
gaccaaggaa 
atgtttttgt 
cttactcaaa 
accagcccct 
tcct gcagac 
tgtgggtcac 
cagcaggagc 
caagggaggt 

gctt cacgtg 



tt gagaacGc 
gctt atttca 
cagagcatrt 
atgatccttg 
atct aatgt a 
tttaagagag 
t ctttggact 
taccttagga 
ggctgtcaca 
atttcttctg 
ttcttttgtt 
aactgaattt 
gaataaattt 
tgagatcatc 
ccttcctaag 
cttctcccac 
taaaaagcca 
tcctcctact 
ctgccacaga 
tgcactggaa 
gatgtaaagc 
gtttgattca 
tagtccatgt 
ggaaccggct 
aagacctgga 
aaccaacggt 
caggcaggaa 
tcaacagatt 
agtcggctga 
tttcatctgg 
tcttcttaaa 
atgaaaacta 
accgcccctc 
ctgtgctgat 
atgacagata 
tgagggactc 
atcaaaacag 
aaatttgaag 
caggacaaag 
agcaggcccg 
gggccttctt 
gtcttttata 
tgaattatta 
cacactgata 
tcatgattgg 
aatgaagaca 
tatgatctct 
aaatacttga 
gtgccgaacg 
acatgagcta 
agatgcagat 
gtgcctggct 
actccct aac 
gtggccaccc 
ggggt cttct 
ggctctagtc 
cjccttcctag 
agacagctac 



acagtgagcc 
tacacaat ct 
acaccgctta 
ctctcagaag 
tgt atgtccg 
ctctagggca 
ggggct aagt 
taacttt get 
gcttgtccct 
ctggcccaca 
t cgttggtaa 
caagagcagt 
taataaatca 
agagcagctt 
tgaagctgtt 
gattggcaac 
taaaaaattc 
gggtctggat 
gagctt cctc 
t agaaatcaa 
tgt gcttaga 
tttttttacg 
t ct ccagaga 
catgtgatga 
aagctgaggg 
gtagattcca 
gcaaatgggg 
ggatgagcgc 
gtcaagtgcc 
gcaccctgtg 
gcagtgatag 
tatctgatgg 
cttcaatccc 
tttctaacac 
cat gat gcac 
accccttt ag 
tgactcaggg 
tgtctttaag 
gctctaagga 
ccgctcacat 
gggtcacatg 
cattagacac 
tttcaaatat 
ycaygatctc 
tttcagactt 
caccacacac 
cccaagtgct 
aagcatgagc 
catctggctt 
gagaaatgta 
taagcaaaat 
cgtttcct at 
cagcat cctt 
aacaccacag 
ggagacccgc 
acgctggtat 
taccacagtt 
gagacaggcc 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4 4 4 0 
4500 
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cctggaaact ggaaaatgcy aagtaaatgt catgcacaat tgttgttcac attttatctc 4560 

aatcactttt accaaatcag gctaaaccct gggtattcat aacgtcttgg gctgtacaaa 4 620 

ttgttccttg aaatgactca gagacatttt ctgaattggc ttccatcagc caagcatttc 4 680 

ttcagaactg gaaaaatgct ttaaatttgg ctttgtcatg attattaaaa cactctgtac 4 740 

attttttatt attgaaatta acacattgcc tactttttaa aaattggaaa aagaaaaaaa 4 800 

aaaaaaaaa 4809 

<210> 570 

<211> 951 

<212> DNA 

<213> Homo sapiens 

<400> 570 

aaaattgaat attgagatac cattctttag tgttaccttt tttacccaca tgtgtttctg 60 
aaaatattgg aattttattc atcttaaaaa ttggacccgg ccttatttac catctttaat 120 
ccattttagt actatgggtg agtacatgga attgaagtct ggcttaaatc ttcagaaagt 180 
tatatatcta ttttatttta tttttttgag acagagtctc gctgtgtcac ccaggctgga 240 
gtgcggtgcc acaatcttgg ctcactgcaa cctctgagtc ccaggttcaa gcgatactca 300 
tgcctcggcc tcctgagtag ctgggactac aggcgtgcac caccacatct ggctaatctt 3 60 
tttttgtatt tttagtagag acggggtttc actgtggtct ccatctcctg acctcgtgat 420 
ccgcctgcct cccaaagtgc tgggattaca ggcatgagcc accgcacaca gctgggactg 4 80 
ggtaatttat aaagaaaaga ggtttaatga ctcacagttc cgcatggctg gagaggcctc 540 
aggaaactta caatcatggt ggaaggcgaa ggggaagcaa ggcacgtctt acatggtggc 600 
aggagagaac gagtgagggg ggagactgcc acaaactttt tttttttgag acaagagtct 660 
ggccctgttg cccaggctgg agtgcagtgg catgatctca gctcactgca acctctgcct 720 
cacaggttca agcaattctc atgcctcagc ctcccgcata gctgggacca caggtatgca 7 80 
ccaccacacc tagctaattt ttgtagtttt agtagagatg gggtctcact atgttgctca 840 
ggctggtcta aaactcctgg gctccagcaa tccgcctgcc ttggcctccc aaagtgctgg 900 
ggttacaggc ataagccacc acatccagcc tgccacatac ttttaaacta t 951 

<210> 571 
<211> 819 
<212> UNA 
<213> Homo sapiens 

<400> 571 

cagcttaaaa atggtttctt gaaatcagtg attagcattc actcaccagt acccctacta 60 
aggggtaggc actggtttgt actcctggga atacaggagt acaccagaat ttatttctgc 120 
ttattgcttt tgttgcaaat gccgtggctt catctgagga attctagaat tcagagggtg 180 
tagccctcca ctctgctgtc ttgctatctg ctctcattgc atccgtttaa cctgcattct 240 
gaaagatgtt tctcaggttt ttccttgacg attttcttct tttctgattc tgacaatgtt 300 
ttaaatcatt gtactgtggt tatcatttct ctgcatttat tttacccatc ttcctttgta 360 
acttgtccta ttgtctttta atttctgcct gttctttatg gctttcaact tcataaataa 420 
catgttttct caaatctctt tgtgaattcc agagagggcc aggcacggtg gctcacatct 480 
gtaatcccag cactttgggg aggctgagac gggtggatca cttgaggtca ggagtttgag 540 
accagcctgg ccaacatggt gaaatcccgt ttcactaaaa atacaaaaat tacccaggca 6O0 
tggtggcggg cgcctgtaat cccaggtact cgggaggctg agggaggaga atcgcttgaa 660 
cctgggaggc tgagggagga gaatcgcttg aacccgggag gcagaggttg cagtgaaccg 720 
agatcatgtt gctgcactcc agcctggtca acagagcaag actctgcctc aaaaacaaac 780 
aaataaacaa acaaacaaac aaaacagaga gattttgct 819 

<210> 572 
<211> 203 
<212> DNA 
<213> Homo sapiens 

<400> 572 

tatagaatac tcaagctatg catcaagctt ggtaccgagc tcggatccac tatttacggc 60 
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20: 



coocagtgt'n ctqgaattcg cccttagctc ggatccacta gtccagtgtg gtggaattcc 120 
attatatrgg gcccaacaca atggagccac cacatccagc ctgccacata ctrttaaact 160 
ciucaggiiCLC atigagaacuc atg 

<2]0> 573 

,211> 132 

<-2l2> PRT 

.21 3> HoTio sapiens 

-■4GG> 57 3 

Val Glu Gly Glu Glv Glu Ala Arg His Val Leu His Gly Gly Arg 
5 ^ 10 15 

Arc Glu Arr:; Val Arg Gly Glu Thr Ala Thr Asn Phe Phe Phe Leu Arg 
20 25 30 

f'^.in Glu Ser Gly Pro Val Ala Gin Ala Gly Val Gin Trp His Asp Leu 
3^ 40 45 

Ser Ser Leu Jin Fro Leu Pro His Arg Phe Lys Gin Phe Ser Cys Leu 

50 55 60 

Ser Leu Pro His Ser Trp Asp His Arg Tyr Ala Pro Pro His Leu Ala 
65 7C 75 80 

Asn Phe Cys Ser Phe Ser Arg Asp Gly Val Ser Leu Cys Cys Ser Gly 
85 90 95 

Trp Ser Lys Thr Pro Gly Leu Gin Gin Ser Ala Cys Leu Gly Leu Pro 
100 105 110 

Lvs Cys Trn Gly Tyr Arg His Lys Pro Pro His Pro Ala Cys His lie 
115 120 125 

Leu Leu Asn Tyr 
130 



<210> 574 
<211> 62 
<212> PRT 

<213> Homo sapiens 
<400> 574 

Met Thr His Ser Ser Ala Trp Leu Glu Arg Pro Gin Glu Thr Tyr Asn 
5 10 15 

His Gly Gly Arg Arg Arg Gly Ser Lys Ala Arg Leu Thr Trp Trp Gin 
20 25 30 

Glu Arg Thr Ser Glu Gly Gly Asp Cys His Lys Leu Phe Phe Phe Glu 
35 40 45 

Thr Arg Val Trp Pro Cys Cys Pro Gly Trp Ser Ala Val Ala 
50 55 60 



<210> 575 
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<211> 76 

<212> PRT 

<213> Homo sapiens 

<400> 575 

Met Val Lys Ser Arg Phe Thr Lys Asn Thr Lys He Thr Gin Ala Trp 
^ 10 15 

Trp Arg Ala Pro Val He Pro Gly Thr Arg Glu Ala Glu Gly Gly Glu 
20 25 30 

Ser Leu Glu Pro Gly Arg Leu Arg Glu Glu Asn Arg Leu Asn Pro Gly 
35 40 45 

Gly Arg Gly Cys Ser Glu Pro Arg' Ser Cys Cys Cys Thr Pro Ala Trp 
50 55 60 

Ser Thr Glu Gin Asp Ser Ala Ser Lys Thr Asn Lys 
65 70 75 

<210> 576 

<211> 68 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 

<222> (1) . . . (68) 

<223> Xaa = Any Amino Acid 

<400> 576 

j^y^ oox ^rg va± cys Leu Pro Ser Thr Thr Val Thr 

5 10 15 

Thr Val Cys Tyr Leu Ala Ser Ser Ser Ala Ser Arg Glu Thr Ala Thr 
20 25 30 

Arg Gin Ala Pro Gly Asn Trp Lys Met Xaa Ser Lys Cys His Ala Gin 
35 40 45 

Leu Leu Phe Thr Phe Tyr Leu Asn His Phe Tyr Gin He Arg Leu Asn 
50 55 60 

Pro Gly Tyr Ser 
65 



<210> 577 

<211> 57 

<212> PRT 

<213> Homo sapiens 

<400> 577 

Met Tyr Leu Glu Asn Ser Phe Tyr Cys Gin Met He Leu Leu Lys Arg 
5 10 15 

Cys Arg Leu Ser Lys He Ser Thr Gin Arg Val Val Pro Asp Gly Pro 



BNSDOCID <WO 0151633A2 I > 



w o (M/5I633 



212 



PC T/l S01/(M574 



20 



30 



Pro Ala Pro Val Pro Gly Ser Phe Pro Met Phe Pro Arg Phe Gly Phe 
35 40 45 



Arg Leu Ala Pro Pro A.la Asp Thr Pro 
50 55 



<210> 578 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<400> 578 

Met Gin Leu lie Tyr Leu Cys Phe Leu Gly Leu Leu Tyr lie Arg His 
5 10 15 

His Asp Ser Gin Ser Phe Val lie Leu Tyr Tyr Lys Lys Leu Asn Tyr 

20 25 30 

Tyr Phe Lvs Tyr Gly Gin He Arg Ala Phe His He Ala Lys Val Tyr 
35 4 0 4 5 



Gin Pro His 
50 



<210> 579 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 579 

Met His Phe Thr Phe Met Gin Leu He Tyr Leu Cys Phe Leu Gly Leu 
5 10 15 

Leu Tyr He Arg His His Asp Ser Gin Ser Phe Val He Leu Tyr Tyr 
20 25 30 

Lys Lys Leu Asn Tyr Tyr Phe Lys Tyr Gly Gin He Arg Ala Phe His 
35 40 45 

He Ala Lys Val Tyr Gin Pro His 
50 55 



<210> 580 
<211> 67 
<212> PRT 

<213> Homo sapiens 
<400> 580 

Met Glu Leu Arg Thr Lys Ala Leu Arg Thr Ala Gin Gin Leu Thr Ser 
5 ■ " 10 15 

Cys Val Thr Ala Leu Lys Ala Ala Gly Pro Pro Leu Thr Phe Trp Lys 
20 25 30 
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Gly Lys Trp Val Gin Cys Cys Leu Pro Leu Trp Gly Leu Leu Gly Ser 
35 40 45 

His Ala Phe Tyr lie Tyr Ala Val Asp lie Phe Met Phe Pro Gly Ser 

50 55 60 

Phe lie His 

65 



<210> 581 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<400> 581 

Met Leu Glu Val Lys Phe Glu Val Ser Leu Arg Pro Thr Gly Asn Glu 
5 10 15 

Thr Ala Gly Gin Thr His Gly Thr Gin Asp Lys Gly Ser Lys Asp Ser 
-'^0 25 30 

Thr Ala Ala Asp He Leu Cys Asp Ser Leu Glu Ser Ser Arg Pro Ala 
35 40 45 

Ala His lie Leu Glu Gly Lys Met Gly Thr Met Leu Ser Ala Thr Leu 
50 55 60 

Gly Pro Ser Trp Val Thr Cys He Leu His Leu Cys Ser 
65 70 75 



<210> 582 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<400> 582 

Met Leu Phe Leu Gin Thr He Asp Thr Lys Cys Thr Gly He Glu He 
5 10 15 

Asn Arg Asn Trp Ser Lys Val Trp His Thr His Ser His Val Asp Val 
20 25 30 

Lys Leu Cys Leu Glu Phe Leu Cys Gly Val Trp Phe Gly Leu Gly Phe 
35 40 45 

Leu Gly Val 
50 



<210> 583 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<400> 583 
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Ser Thr Ser Asp Glv ?he Ala Pro Pro Pro Gin Leu Gly Ser Arg 
5 ^ 10 15 

"■ys Ser His lie Arg Gly Pro He Lys He Ala Arg Asn Lys Phe Pro 

20 ' 25 30 

Arg Thr Leu Thr Ser Gin Glu Leu Arg Arg Phe Ala Glu Tyr Ser Gly 
35 40 45 

Met Met Phe Gly Asp Gin Thr Thr Ala Gly Gin Lys 
50 55 60 



<210> 584 
<211> 76 
<212> PR? 

<213> fiomo sapiens 
<400> 584 

Met Cys Leu Cys He Pro Leu Gly Gly Tyr Gin Glu Leu Cys His Cys 
5 10 15 

r^et Ser Thr Ser Asp Gly Phe Ala Pro Pro Pro Gin Leu Gly Ser Arg 
20 ^ 25 30 

Cys Ser Kis He Arg Gly Pro He Lys Ho Ala Arg Asn Lys Phe Pro 
35 40 45 

Arg Thr Leu Thr Ser Gin Glu Leu Arg Arg Phe Ala Glu Tyr Ser Gly 
50 55 60 

Met Met Phe Gly Asp Gin Thr Thr Ala Gly Gin Lys 
65 70 75 



<210> 585 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 585 

Met Val Tyr Arg Phe Gly Gin Met Ser Asp Asn Pro Phe Tyr He Leu 
5 10 15 

Ala Ser Leu Gly Ser Ser Ser Cys Arg Asn Gly Leu Ala Ser Lys Trp 
20 25 30 

Arg Gin Ala Asp Pro Ser Asp Gly Tyr Met Glu Pro Cys Phe Gin Leu 
35 40 45 

Leu Phe 
50 



<210> 586 

<211> 60 

<212> PRT 

<213> Homo sapiens 
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<400> 586 

Met Leu Val His 



Gin Met Ser Asp 
20 

Ser Cys Arg Asn 
35 

Asp Gly Tyr Met 
50 



He Tyr Ser Cys 
5 

Asn Pro Phe Tyr 

Gly Leu Ala Ser 
40 

Glu Pro Cys Phe 
55 



Cys Gly Met Val 
10 

He Leu Ala Ser 
25 

Lys Trp Arg Gin 



Gin Leu Leu Phe 
60 



Tyr Arg Phe Gly 
15 

Leu Gly Ser Ser 
30 

Ala Asp Pro Ser 
45 



<210> 587 

<211> 1408 

<212> DNA 

<213> Homo sapiens 

<400> 587 

ctggacactt tgcgagggct tttgctggct gctgctgctg cccgtcatgc tactcatcgt 60 
agcccgcccg gtgaagctcg ctgctttccc tacctcctta agtgactgcc aaacgcccac 120 
cggctggaat tgctctggtt atgatgacag agaaaatgat ctcttcctct gtgacaccaa 180 
cacctgtaaa tttgatgggg aatgtttaag aattggagac actgtgactt gcgtctgtca 24 0 
gttcaagtgc aacaatgact atgtgcctgt gtgtggctcc aatggggaga gctaccagaa 300 
tgagtgttac ctgcgacagg ctgcatgcaa acagcagagt gagatacttg tggtgtcaga 360 
aggatcatgt gccacagatg caggatcagg atctggagat ggagtccatg aaggctctgg 420 
agaaactagt caaaaggaga catccacctg tgatatttgc cagtttggtg cagaatgtga 480 
cgaagatgcc gaggatgtct ggtgtgtgtg taatattgac tgttctcaaa ccaacttcaa 54 0 
tcccctctgc gcttctgatg ggaaatctta tgataatgca tgccaaatca aagaagcatc 600 
gtgtcagaaa caggagaaaa ttgaagtcat gtctttgggt cgatgtcaag ataacacaac 660 
tdcact^tdot adytctgaag atgggcarta rgcaagaaca gattatgcag agaatgctaa 720 
caaattagaa gaaagtgcca gagaacacca cataccttgt ccggaacatt acaatggctt 780 
ctgcatgcat gggaagtgtg agcattctat caatatgcag gagccatctt gcaggtgtga 840 
tgctggttat actggacaac actgtgaaaa aaaggactac agtgttctat acgttgttcc 900 
cggtcctgta cgatttcagt atgtcttaat cgcagctgtg attggaacaa ttcagattgc 960 
tgtcatctgt gtggtggtcc tctgcatcac aaggaaatgc cccagaagca acagaattca 1020 
cagacagaag caaaatacag ggcactacag ttcagacaat acaacaagag cgtccacgag 1080 
gttaatctaa agggagcatg tttcacagtg gctggactac cgagagcttg gactacacaa 1140 
tacagtatta tagacaaaag aataagacaa gagatctaca catgttgcct tgcatttgtg 1200 
gtaatctaca ccaatgaaaa catgtactac agctatattt gattatgtat ggatatattt 1260 
gaaatagtat acattgtctt gatgtttttt ctgtaatgta aataaactat ttatatcaca 1320 
caatawagtt ttttctttcc catgtatttg ttatatataa taaatactca gtgatgagaa 1380 
aaaaaaaaaa aaaaaaaaaa rwmgaccc 1408 

<210> 588 

<211> 81 

<212> PRT 

<213> Homo sapiens 

<400> 588 

Met Pro Gin Lys Gin Gin Asn Ser Gin Thr Glu Ala Lys Tyr Arg Ala 
5 10 15 

Leu Gin Phe Arg Gin Tyr Asn Lys Ser Val His Glu Val Asn Leu Lys 
20 25 30 
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:'iy Ala Cys Phe Thr Val Ala Gly Leu Pro Arg Ala Trp Thr Thr Gin 
35 40 45 

^■vr 5^er He He Asp Lys Arg He Arg Gin Glu He Tyr Thr Cys Cys 
50 ' 55 60 

v-i: Ala Phe Val Val He Tyr Thr Asn GJu Asn Met Tyr Tyr Ser Tyr 

70 75 8C 

J le 



-2j0> 589 
</lj> 157 
<212> PRT 

<213> Homo sapiens 
^;4 00> 58 9 

Met Thr Met Cys Leu Cys Val Ala Pro Met Gly Arg Ala Thr Arg Met 
5 10 15 

Sei Val Thr Cys Asp Arg Leu His Ala Asn Ser Arg Val Arg Tyr Leu 
20 25 30 

Trp Cys Gin Lys Asp His Val Pro Gin Met Gin Asp Gin Asp Leu Glu 
35 40 45 

Met Glu Ser Met Lys Ala Leu Glu Lys Leu Val Lys Arg Arg His Pro 
50 55 60 

Pro Val He Phe Ala Ser Leu Val Gin Asn Val Thr Lys Met Pro Arg 
65 70 75 80 

Met Ser Gly Val Cys Val He Leu Thr Val Leu Lys Pro Thr Ser He 
85 90 95 

Pro Ser Ala Leu Leu Met Gly Asn Leu Met He Met His Ala Lys Ser 
100 105 110 

Lys Lys His Arg Val Arg Asn Arg Arg Lys Leu Lys Ser Cys Leu Trp 
115 120 125 

Val Asp Val Lys He Thr Gin Leu Gin Leu Leu Ser Leu Lys Met Gly 
130 135 140 

He Met Gin Glu Gin He Met Gin Arg Met Leu Thr Asn 
145 150 155 



^.210> 590 
<211> 347 
■':H2> PRT 

^213> Homo sapiens 

<4 00> 5 90 

eu He Val Ala Arg Pro Val Lys Leu Ala Ala Phe Pro Thr 
5 10 15 



BNSDOC;[ ■ ..w: ■ u 'o:.' 'A.- 



wo *» 1/5 1 633 



PCT/llSO 1/(11574 



217 



Scr Leu Ser Asp 
20 

Asp Asp Axg Glu 

35 

Phe Asp Gly Glu 
50 

Gin Phe Lys Cys 
65 

Glu Ser Tyr Gin 



Cys Gin Thr Pro 



Asn Asp Leu Phe 

40 

Cys Leu Arg lie 
55 



Thr Gly 

25 

Leu Cys 

Gly Asp 



Asn Asn Asp Tyr Val Pro 
70 



Gin Ser Glu lie 
100 

Gly Ser Gly Ser 

115 

Gin Lys Glu Thr 
130 

Asp Glu Asp Ala 
145 

Gin Thr Asn Phe 



Asn Glu Cys Tyr 
85 

Leu Val Val Ser 



Leu Arg 
90 

Glu Gly 
105 



Gly Asp Gly Val His Glu 
120 



Ser Thr Cys Asp 
135 

Glu Asp Val Trp 
150 

Asn Pro Leu Cys 
165 



Asn Ala Cys Gin He Lys Glu Ala 



Glu Val Met Ser 
195 

Lys Ser Glu Asp 
210 

Asn Lys Leu Glu 

225 

His Tyr Asn Gly 



Leu Gly Arg Cys 
200 

Gly His Tyr Ala 
215 



He Cys 

Cys Val 

Ala Ser 
170 

Ser Cys 
165 

Gin Asp 
Arg Thr 



Glu Ser Ala Arg Glu His 
230 



Met Gin Glu Pro 
260 

Cys Glu Lys Lys 
275 

Arg Phe Gin Tyr 
290 

Ala Val He Cys 
305 



Phe Cys Met His 
245 

Ser Cys Arg Cys 



Gly Lys 
250 

Asp Ala 
265 



Asp Tyr Ser Val Leu Tyr 
280 

Val Leu He Ala Ala Val 
295 



Trp Asn Cys Ser Gly Tyr 
30 

Asp Thr Asn Thr Cys Lys 

45 

Thr Val Thr Cys Val Cys 
60 

Val Cys Gly Ser Asn Gly 
75 80 

Gin Ala Ala Cys Lys Gin 
95 

Ser Cys Ala Thr Asp Ala 
110 

Gly Ser Gly Glu Thr Ser 
125 

Gin Phe Gly Ala Glu Cys 
140 

Cys Asn He Asp Cys Ser 
155 160 

Asp Gly Lys Ser Tyr Asp 
175 

Gin Lys Gin Glu Lys He 

190 

Asn Thr Thr Thr Thr Thr 

205 

Asp Tyr Ala Glu Asn Ala 
220 

His He Pro Cys Pro Glu 
235 240 

Cys Glu His Ser He Asn 
255 

Gly Tyr Thr Gly Gin His 
270 

Val Val Pro Gly Pro Val 

285 

He Gly Thr He Gin He 
300 



Val Val Val Leu 
310 



Cys He Thr Arg Lys Cys Pro Arg 
315 ' 320 
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Soi Asn Arg lie His Arg Gin Lys Gin Asn Thr Giy His Tyr Ser Ser 

325 ' 330 335 

A,^-^. Asn Thr Thr Arg Ala Ser Thr Arg Leu lie 

340 345 



<21C> 591 
<33 1> 565 
-.212> DNA 
-Ji3> Homo sapien 



<4C0> 591 

actaaagcaa 

ct tcatgcct 

aaacagacaa 

caggaagcaa 

atgcacgtac 

catttcatat 

tlatcttgtt 

tactgtagta 

attctctata 

caattggtt a 



atgaacaagc 
tgactcatgt 
aaaatattgt 
gagttaatcc 
cttcatttgg 
gggcaacctg 
ttcctgt cat 
aagcatttca 
attcccaagg 
aacaaaaaaa 



tgacttgcta 
aaatgcaata 
acaacattgc 
cagaggtcta 
aaaattgtca 
ccagacagga 
gggagttcca 
aaaatt ctt a 
acaggccata 
aaaaa 



gtatcatctg 
ggattaaaaa 
acccagtgtc 
tgtcctaatg 
tttgtccatg 
gaaagtactt 
gaaaaagtta 
aatcagtgga 
attgaaggaa 



catt cattga 


agcacaagaa 


60 


ataaatttga 


tatcacatgg 


120 


agattctaca 


cctggccact 


180 


tgttatggca 


aatggatgtc 


240 


tgacagttga 


tacttattca 


300 


cccatgttaa 


aagacattta 


360 


aaacagacaa 


tgggccaggt 


420 


aaattacaca 


tacaatagga 


480 


ctaatagaac 


actcaaagct 


540 






565 



<210> 592 
<211> 188 
<212> PRT 
<213> Homo sapien 

<400> 592 

Thr Lys Ala Asn Glu Gin Ala Asp Leu Leu Val Ser Ser Ala Phe lie 

1 5 10 15 

tr^.lu Ala Gin Glu Leu His Ala Leu Thr His Val Asn Ala lie Gly Leu 

20 25 30 

Lys Asn Lys Phe Asp lie Thr Trp Lys Gin Thr Lys Asn lie Val Gin 

35 40 45 

His Cys Thr Gin Cys Gin lie Leu His Leu Ala Thr Gin Glu Ala Arg 

50 55 60 

Val Asn Pro Arg Gly Leu Cys Pro Asn Val Leu Trp Gin Met Asp Val 
65 70 75 80 

Met His Val Pro Ser Phe Gly Lys Leu Ser Phe Val His Val Thr Val 

85 90 95 

Asp Thr Tyr Ser His Phe lie Trp Ala Thr Cys Gin Thr Gly Glu Ser 

100 105 110 

Thr Ser His Val Lys Arg His Leu Leu Ser Cys Phe Pro Val Met Gly 

115 120 125 

Val Pro Glu Lys Val Lys Thr Asp Asn Gly Pro Gly Tyr Cys Ser Lys 

130 135 140 

Ala Phe Gin Lys Phe Leu Asn Gin Trp Lys lie Thr His Thr lie Gly 
145 150 155 160 

lie Leu Tyr Asn Ser Gin Gly Gin Ala lie lie Glu Gly Thr Asn Arg 

165 170 175 

Thr Leu Lys Ala Gin Leu Val Lys Gin Lys Lys Lys 
180 185 

<210> 593 
<2:il> 271 
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<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 

<222> (1) . . . (271) 
<223> n = A, T, C or G 



<400> 593 

actttatgtt cnagtgcana aanccncctg gattgccacc ntactctcag ggctgtgant 60 

tgtgcnccca nagcaacctg ggcacgcggg gacagggggg ccnacaattg agggagcggt 120 

gtccctagct ggggtctata catgncnggg naagggcngc tgagtnccat nagcaaagga 180 

nctagnatnt gcgggggtgc ggcctgggcc taccctttna agcatccntn gatccactcc 240 

angaanccng gggtagncag gtttnccaac a 271 



<210> 594 
<211> 376 
<212> DNA 
<213> Homo sapien 



<220> 

<221> Tnisc_f eature 
<222> (1) , . . (376) 
<223> n = A,T,C or G 



<400> 594 

cctttggggg 

gcgccctcnn 

cgattaagcg 

cgattcgacg 

cccacgangg 

sgggggntgc 



nggggggaac 
gggccaacaa 
ncaaatgtgt 
acaaggcgtn 
nttcgtgtcg 
cgcccttagg 



ctttaccatt 
agttatcgtn 
agcaaaangc 
gcgcgntanc 
tcacatggnc 
gaggnagggg 



gtnccccttt 
nttgaagaga 
cgtgccactt 
gttagtctcn 
tctagacata 
tggggacact 



Gca^i^gaaga aaaggn 

<210> 595 
<211> 242 
<212> DNA 
<213> Homo sapien 



atttcatttg 
anattttttt 
gtggcgtagc 
aatngacccn 
acgcncnccn 
agccaancca 



gttngggttc 
ggnttngncc 
tncgtcgggt 
gtggcatgag 
ttttttncag 
nantctnacc 



60 
120 
180 
240 
300 
360 
376 



<220> 

<221> misc_feature 
<222> (1)...(242) 
<223> n = A, T,C or G 



catnntgagg acaanagtng cactgaggct 60 
aaaaagcatc cacccacctc tgnaangggt 120 
angagncccn ggcatgntac atcttcttcc 180 
ttttcttttt ctcttaaggg ncncntggct 240 

242 

<210> 596 
<211> 535 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc feature 



<400> 595 
agnctgctgn 
tgngnatgcc 
atgccangag 
acccctnaaa 
to 



tcgtnccctn 
aggcaaggnc 
cangtgcacc 
ntttgngcta 



tatgtggctt 
aagctggctc 
agtcccaact 
caangnccat 
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^2-2> (1) . . . (535) 
<223> n = A, ?, C or G 



<400> 596 

dccagttgga tactgctaaa nagatattta rgcagcctca tatgttaagt cgtatatrtt 60 

gaaagctttt taaatttttt ctttaagaag attttagatg cttatcactg agtaccagag 120 

oqatgtaggc tgatgccctr atcaacaaag tcagggactg tqgcacacaa ggattgacta 180 

ctgcagacac ggccacaatg ctacctctag agggcctgaa tccccctgcc ctctctggtg 240 

qggagaaggg ctggcagagc cattagcatg ggctccggcc aatcctggcc actttgacac 300 

rcctggtgct gacccagggt cctggaggaa gggatgaggt gggcagtaga gatgctcagg 360 

gcagtggccc ctttccatcc acactggaac tatttcagta ttttaccacc aattcagcca 420 

ttcccttgtg cgctggctga acatcagccc tgctccaggt ctcagtttcc cctttgtaaa 480 

gogaaagctc tggattcagg gagtgatgaa gaggtcatca tggtcttgag aattc 535 



<210> 597 
<211> 257 
<212> DNA 
'-.213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (257) 
<223> n = A, T, C or G 



<400> 597 

tttcnatacc caaaantacc ccatattang accanacatt tgtctnggaa aaattaccat 60 

tntntaacnt ttgggccacc tgagannaaa tgggtgtaat ncatgataag atggancagn 120 

aLtnctctta agatnngatn agaccccgtt tttcacggaa catatccaag nacccaatag 180 

gnaacaagcc acgggnggag tcacaaacat atattcttta ctctcataat ccgtnncaca 240 

naactnttgn acttgac 257 



-.210> 598 

-.2il> 222 

<;:12> DNA 

<213> Homo sapien 

<220> 

':221> mis cofeature 
<222> (1) . . . (222) 
<223> n = A, T, C or G 



<400> 598 

nntggntacc gtcnaaactt nncttggtac ccgagctcgg atccactagt ccagtgtggt 60 

qqaattccat tgtgttgqgc tataagctgt aatagtggag ncgtgctngg ttcattgcan 120 

nagnccctcc gcanncacnc ttgnnacaac ctgtgagnag gcnataaatt attcacataa 180 

tcatcactgc atgaanctga ctcaaacgca tccacntaca cc 222 



<210> 599 
<211> 238 
<212> DNA 
<213> Homo sapien 

>20> 

<221> misc_feature 

<222> (1) . . . (238) 

' 223> n = A, T, C or G 

<400> 599 
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gcatgacatc ancgatgtnt ttggnnacct ganattngct aaaactngng natgccgggn 60 

atgnaggttt ggtantgatc tatgcactca catctcatgg ggacgtttca tgtggagtgn 120 

tcgacaangt tgctgnancn gagaagtgat gatctcagtt gaaagggtca tgtgaataca 180 

cnttacactt gaaaaagaag cacattggga atatcacgaa acgnccacca acatcctg 238 



<210> 600 
<211> 232 
<212> DNA 
<213> Homo sapien 

<220> 

<221> inisc__feature 
<222> (1)...{232) 
<223> n = A,T,C or G 



<400> 600 

cgaactattt agactaccta ggaaaattat tttagtatca gaagaatatc aggggtgtag 60 

tactcatcag agctaaatga gagcgcttta aaaatgttag tttgtcttcc gccatttcta 120 

cagaaagctg caatttcagg ttttcaacct aataggtgat atttaanaaa aaaaaaaagc 180 

aatcgcaaat agccccactg cttttacaaa tcattttttc cccaacacaa tg 232 



<210> 601 
<211> 547 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (547) 
<223> n = A,T^C or G 



<400> 601 

cattgtgttg gggaaaaaat gatttgtata agcagLgggg ctatttgcga ttgctttttt 60 

tttttcttaa atatcaccta ttaggttgaa aacctgaaat tgcagctttc tgtagaaatg 120 

gcggaagaca aactaacatt tttaaagcgc tctcatttag ctctgatgag tactacaccc 180 

ctnatattct tctgatacta aaataatttt cctagtgtag tctaaacttt tttaaaaaga 240 

catgtaatcc gcggagttag taactcaaaa cgagtgcatc tnggaagtat cgcagccgtt 300 

nctggatnaa attcccagct tgctngcttg ctnagccggg gggcggtnaa aaaaacatct 360 

gcagcccngg ggnaaaaacc ttcgcattgt tcttacgtgt ttacgttatt ttatttccct 420 

nnagcaaggc nggganttgg ggactcgaaa tggtacagtt gggctgggga tcgcccttgt 480 

tacataaaag ncgtccagaa gagggacggt tacaggcngg ganctccaaa ggtcagtccc 54 0 

tgccatt 547 



<210> 602 
<211> 826 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (826) 
<223> n = A, C or G 



<400> 602 

cggggggnnt tacgtctctc tggacgcttt tattgtacca gggcgatccc agcccaactg 60 

taccattcga gtccctactc ctgccttgct ctagggaaat aaaataacgt aaacacgtaa 120 

gaacaatgcg aaagcgtttt cttccctagg ctgcagattg tcttcttcac cgcccctgct 180 

tagctagcta gctagctggg aatttaatcc agaaacggct tgcgatacct cctagatgca 240 
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ctcgttttga gttacaaact ccgcggatta catgtctttt taaaaaagtt tagactacac 3C0 

tagggaaaat tarrttagta tcagaagaat atcagqgggt gtagtactca tcagagctna 360 

atgagagcgc ttiaaaaatg ttagtttgtc ttccgccatt tctacagaaa gctgcaattt 420 

caggttttca ncctaatagg tgatatntaa gaaaaaaaaa acaatcgcan atagcccacr 480 

gcttttacaa atcatttttc tcttctaggt atagcctgtc aggtggccta atgtattttt 540 

gacatctcta ggaattttaa tagaccagaa atgggtgcca gagatatgcc tgcactaatc 600 

ttaagtgggg atttatgtat ttctcaanca agtgattaaa gcaaaactag gcacgaatga 660 

aatcaagatc tttaggccag aaatcatgaa nanttttana attattttan gaatctgtgg 720 

c^ttctcttct taaaatngaa aaaaaaattg tttaaaccca naaggtctga atacccaagc 780 

nccctgaacn anagaacaan gccggagcac cccctcccaa atcccc 826 



<21C> 603 
<211> 817 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> {!)... (817) 
<223> n - A, T, C or G 



<400> 603 

nnangacttt tgtggtntta tacaattntt ttttctattt ctatgaagag aaagccacag 60 

agtcctaaaa taattctaaa actcatcatg actttcttgc ctaaaagatc ttgatttcaa 120 

tcgtgcctag ttttgcttta atcacttgct tgagaaatac ataaatcccc acttaagatt 180 

agtgcaggca tatctctggc acccatttct ggttctatta aaattcctag agatgtcaaa 240 

aattacatta gcccacctga caggctatac ctagaagaga aaaaatgatt tgtaaaagca 300 

gtggggctat ttgcgattgc tttttttttt tcttaaatat cacctattag gttgaaaacc 360 

tgaaattgca gctttctgta gaaatggcgg aagacaaact aacattttta aagcgctctc 420 

atttagctct gatgagtact acacccctga tattcttctg atactaaaat aattttccta 480 

gtgtagtcta aactttttta aaaagacatg taatccgcgg agtttgtaac tcaaaacgag 540 

tqcatctagg aggtatcgca agccgtttct ggattaaatt cccagctagc ttgcttgctt 600 

'igcaggggcg ggnaaanaag acatctgcag cctagggaag aaaacctttc gcattgttct 660 

tacgtgttta cgttatttta tttcctanaa caaggcngaa ttgggactcg aatggttcag 720 

ttggggtggg ggatcccctg gtncataaaa ngtcanaaag anggtacagg cggaacncca 780 

agggtcgtcc tgcatttana ctcggaattt tggtgcc 817 



<210> 604 
<211> 694 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (694 ) 
<223> n = A,T,C or G 



<400> 604 

cttttcaaat catttttnct cttctaggta tancctgtca ggtggcctaa tgtaattttt 60 

gacatctcta ngaattttaa tagaaccaga aatgggtgcc agagatatgc ctgcactaat 120 

cttaagtggg gatttatgta tttctcaagc aagtgattaa agcaaaacta ggcacgattg 180 

aaatcaagat cttttaggca anaaagtcat gatgagtttt agaattattt taggactctg 240 

Iggctttctc ttcatagaaa tagaaaaaaa aattgtataa aaccacaaaa ggtcctgaat 300 

agccaaagca acactganca aaaagaacan agcagggaag caacacacta ccngaattca 360 

aattatacta ccagggtgta gtaaccaaaa cagcattcta ttggcataaa atagacacca 420 

agaccaatgg ancagaataa agaaccccac aaataaatcc atatatntac cgccanctga 480 

ttatcaataa cnaacaccaa gaacatatnt taagggacnt nctattcaat aantagtgct 540 

ggnaaaaact gggaaatcca tatgcagaaa naatgaaact agacccctat ccctcaccat 600 
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acgcaaannt caacttcgga atgggattac aaaacttaag acattccaac ccaagaaact 
atnaaancta ctattaagaa aacagatcnc nccc 

<210> 605 
<211> 678 
<212> DNA 
<213> Homo sapien 



660 
694 



<220> 

<221> misc_feature 
<222> (1) . . . (678) 
<223> n = A,T,C or G 



<400> 605 

taaaaatcta 

actcatcana 

agaaagctgc 

atcgcaaata 

ggtggcctaa 

agagatatgc 

agcaaaacta 

anaattattt 

aaaaccacaa 

agcaacacac 

attgggcata 

cctatattta 



gactacacta 
gctaaatgag 
aatttcaggt 
gccccactgc 
tgtaattttt 
ctgcactaat 
ggcacgattg 
taggactctg 
aaggtcctga 
taccggaatt 
aaatagacca 
cngcccnc 



ggaaattatt 
agcgctttaa 
tttcaaccta 
ttttacaaat 
gacatctcta 
cttaagtggg 
aaatcaanat 
tggctttctc 
atagcccaaa 
caattatact 
aagaccagtg 



ttantatcag 
aaatgttagt 
ataggtgata 
cattttttct 
ggaattttaa 
gatttatgta 
cttttaggca 
ttcatagaaa 
gcaacactga 
accaaggtgt 
ggaaacagaa 



aagaatatca 
ttgtcttccg 
tttaagaaaa 
cttctaggta 
tagaaccaga 
tttctcaagc 
agaaagtcat 
tagaaaaaaa 
acaaaangaa 
antaaccaaa 
taaagaancc 



ggggtgtagt 
ccatttctac 
aaaaaaagca 
tagcctgtca 
aatgggtgcc 
aagtgattaa 
gatgagtttt 
aaattgtata 
caaagcagga 
acagcattct 
caaaataaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
678 



<210> 606 
<211> 263 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . , (263) 
<223> n A, T,C or G 



<400> 606 

gtggggtcng cancagccaa ctcagcttcc tttcgggctt tgttagcaga cggatcatcc 

tctagtccac tgtgntcaaa ttccattgtg tgggggccnc tcgcctcggc canagatctg 

agtgancana cntgtcccca ctgaggtgcc ccacagcngn ttgtnttcag cangggctna 

caactcgacc ggcagcgnan ggctggcaga antgngcgcc tnnctcattc ctacgcngtn 

ngccgcagga aggangacag gcc 

<210> 607 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



60 
120 
180 
240 
263 



<400> 607 

ccatgtgggt cccggttgtc tt 

<210> 608 
<211> 22 
<212> DNA 



22 
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Artif ici al 



Sequence 



Pri.TT.er 



<.4i)Cy 608 

uat aqgggtg ctcaggqatt gg 



22 



<210> 60 9 
<Xil> 4 0 
.2:2> DNA 

<213> Artificial Sequence 

<22C> 

<223> Primer 
<400> 609 

gctggacagg rrggcaaaagc tggggcagtg aaccatgtgc 40 

<210> 610 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 610 

ccttgtccag atagcccagt agctgac 27 

<210> 611 
<211> 4 6 
:2T2> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 611 

gatagagaaa accgtccagg ccagtattgt gggaggctgg gagtgc 46 

<210> 612 
<211> 40 
<212> DNA 

^:213> Artificial Sequence 
<220> 

<223> Primer 
<400> 612 

gcacatggtt cactgcccca gcttttgccc cctgtccagc 40 

<210> 613 
<211> 38 
-'212> DNA 

-\:]'}3> Artificial Sequence 



<220> 
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<223> Primer 
<400> 613 

gccgctcgag ttagaattcg gggttggcca cgatggtg 

<210> 614 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 614 

cggcgggcat atgcatcacc atcaccatca catcataaac ggcgaggact gca 

<210> 615 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



38 



53 



<400> 615 

gcactcccag cctcccacaa tactggcctg gacggttttc tctatc 

<210> 616 

<211> 1350 

<212> DNA 

<213> Homo sapien 



46 



<400> 616 

atgcatcacc 

tggcaggcgg 

cagtgggtgc 

cacagtcttg 

cggcacccag 

gaatccgtgt 

gcggggaact 

gtgctgcagt 

ccgctgtacc 

aacggtgact 

ggaaaagccc 

actgagtgga 

cattcccaac 

ctggtgcacc 

ttgctgggtc 

cacagcttcc 

ggtgatgact 

gatgctgtga 

gcctcaggct 

gtggacctcc 

aagttcatgc 

gccctgcccg 

gacaccatcg 



atcaccatca 
cactggtcat 
tgtcagccgc 
aggccgacca 
agtacaacag 
ccgagtctga 
cttgcctcgt 
gcgtgaacgt 
accccagcat 
ctggggggcc 
cgtgtggcca 
tagagaaaac 
cctggcaggt 
cccagtgggt 
ggcacagcct 
cacacccgct 
ccagccacga 
aggtcatgga 
ggggcagcat 
atgttatttc 
tgtgtgctgg 
aaaggccttc 
tggccaaccc 



catcataaac 
ggaaaacgaa 
acactgtttc 
agagccaggg 
acccttgctc 
caccatccgg 
ttctggctgg 
gtcggtggtg 
gttctgcgcc 
cctgatctgc 
agttggcgtg 
cgtccaggcc 
gcttgtggcc 
cctcacagct 
gtttcatcct 
ctacgatatg 
cctcatgctg 
cctgcccacc 
tgaaccagag 
caatgacgtg 
acgctggaca 
cctgtacacc 
cgaattctaa 



ggcgaggact 
ttgttctgct 
cagaactcct 
agccagatgg 
gctaacgacc 
agcatcagca 
ggtctgctgg 
tctgaggagg 
ggcggagggc 
aacgggtact 
ccaggtgtct 
agtattgtgg 
tctcgtggca 
gcccactgca 
gaagacacag 
agcctcctga 
ctccgcctgt 
caggagccag 
gagttcttga 
tgtgcgcaag 
gggggcaaaa 
aaggtggtgc 



gcagcccgca 
cgggcgtcct 
acaccatcgg 
tggaggccag 
tcatgctcat 
ttgcttcgca 
cgaacggcag 
tctgcagtaa 
aagaccagaa 
tgcagggcct 
acaccaacct 
gaggctggga 
gggcagtctg 
tcaggaacaa 
gccaggtatt 
agaatcgatt 
cagagcctgc 
cactggggac 
ccccaaagaa 
ttcaccctca 
gctggggcag 
attaccggaa 



ctcgcagccc 
ggtgcatccg 
gctgggcctg 
cctctccgta 
caagttggac 
gtgccctacc 
aatgcctacc 
gctctatgac 
ggactcctgc 
tgtgtctttc 
ctgcaaattc 
gtgcgagaag 
cggcggtgtt 
aagcgtgatc 
tcaggtcagc 
cctcaggcca 
cgagctcacg 
cacctgctac 
acttcagtgt 
gaaggtgacc 
tgaaccatgt 
gtggatcaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1350 



<210> 617 
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• ::l l> 44 9 

<^12> PRT 

' 21 3"' Homo sapien 



<rAOO> €11 



Kei. 


His 


His 


His 


His 


His 


His 


He 


He 


Asn 


Gly 


Glu 


Asp 


Cys 


Ser 


Pro 


1 








5 










1 0 










1 5 




H 


Sor 


Gin 


Pro 


Trp 


Gin 


Ala 


Ala 


Leu 


Val 


Met 


Glu 


Asn 


Glu 


Leu 


Phe 








20 










25 
















Cys 


Ser 


Gly 


Val 


Leu 


Val 


His 


Pro 


Gin 


Trp 


Val 


Leu 


Ser 


A.la 


Ala 


His 




35 










40 










45 








Cys 


Phe 


Gin 


Asn 


Ser 


Tyr 


Thr 


He 


Gly 


Leu 


Gly 


Leu 


His 


Ser 


Leu 


Glu 




50 










55 










60 










Ala 


Asp 


Gin 


Glu 


Pro 


Gly 


Ser 


Gin 


Met 


Val 


Glu 


Ala 


Ser 


Leu 


Ser 


Val 


65 










70 










75 










80 


Arg 


His 


Pro 


Glu 


Tyr 


Asn 


Arg 


Pro 


Leu 


Leu 


Ala 


Asn 


Asp 


Leu 


Met 


Leu 










85 










90 










95 




lie 


Lys 


L^eu 


Asp 


Glu 


Ser 


Val 


Ser 


Glu 


Ser 


Asp 


Thr 


He 


Arg 


Ser 


He 








100 










105 










110 






Ser 


lie 


Ala 


Ser 


Gin 


Cys 


Pro 


Thr 


Ala 


Gly 


Asn 


Ser 


Cys 


Leu 


Val 


Ser 






115 










120 










125 








Gly 


Trp 


Gly 


Leu 


Leu 


Ala 


Asn 


Gly 


Arg 


Met 


Pro 


Thr 


Val 


Leu 


Gin 


Cys 




130 










135 










140 










Va 1 


Asn 


Val 


Ser 


Val 


Val 


Ser 


Glu 


Glu 


Val 


Cys 


Ser 


Lys 


Leu 


Tyr 


Asp 


145 










150 










155 










160 


Pro 


Leu 


Tyr 


His 


Pro 


Ser 


Met 


Phe 


Cys 


Ala 


Gly 


Gly 


Gly 


Gin 


Asp 


Gin 










165 










170 










175 




Lys 


Asp 


Ser 


Cys 


Asn 


Gly Asp 


Ser 


Gly 


Gly 


Pro 


Leu 


He 


Cys 


Asn 


Gly 








180 










185 










190 






Tyr 


Leu 


Gin 


Gly 


Leu 


Val 


Ser 


Phe 


Gly 


Lys 


Ala 


Pro 


Cys 


Gly 


Gin 


Val 






195 










200 










205 








Gly 


Val 


Pro 


Gly 


Val 


Tyr 


Thr 


Asn 


Leu 


Cys 


Lys 


Phe 


Thr 


Glu 


Trp 


He 




210 










215 










220 










Glu 


Lys 


Thr 


Val 


Gin 


Ala 


Ser 


He 


Val 


Gly 


Gly 


Trp 


Glu 


Cys 


Glu 


Lys 


225 










230 










235 










240 


His 


Ser 


Gin 


Pro 


Trp 


Gin 


Val 


Leu 


Val 


Ala 


Ser 


Arg 


Gly 


Arg 


Ala 


Val 










245 










250 










255 




Cys 


Gly 


Gly 


Val 


Leu 


Val 


His 


Pro 


Gin 


Trp 


Val 


Leu 


Thr 


Ala 


Ala 


His 








260 










265 










270 






Cys 


He 


Arg 


Asn 


Lys 


Ser 


Val 


He 


Leu 


Leu 


Gly 


Arg 


His 


Ser 


Leu 


Phe 






275 










280 










285 








His 


Pro 


Glu 


Asp 


Thr 


Gly 


Gin 


Val 


Phe 


Gin 


Val 


Ser 


His 


Ser 


Phe 


Pro 




290 










295 










300 










His 


Pro 


Leu 


Tyr 


Asp 


Met 


Ser 


Leu 


Leu 


Lys 


Asn 


Arg 


Phe 


Leu 


Arg 


Pro 


305 










310 










315 










O O A 


Gly 


Asp 


Asp 


Ser 


Ser 


His 


Asp 


Leu 


Met 


Leu 


Leu 


Arg 


Leu 


Ser 


Glu 


Pro 










325 










330 










335 




Ala 


Glu 


Leu 


Thr 


Asp 


Ala 


Val 


Lys 


Val 


Met 


Asp 


Leu 


Pro 


Thr 


Gin 


Glu 








340 










345 










350 






Pro 


Ala 


Xjeu 


Gly 


Thr 


Thr 


Cys 


Tyr 


Ala 


Ser 


Gly 


Trp 


Gly 


Ser 


He 


Glu 






355 










360 










365 








Pro 


Glu 


Glu 


Phe 


Leu 


Thr 


Pro 


Lys 


Lys 


Leu 


Gin 


Cys 


Val 


Asp 


Leu 


His 




370 










37 5 










380 










Val 


He 


Ser 


Asn 


Asp 


Val 


Cys 


Ala 


Gin 


Val 


His 


Pro 


Gin 


Lys 


Val 


Thr 


385 










390 










395 










400 


Lys 


Phe 


Met 


Leu 


Cys 


Ala 


Gly 


Arg 


Trp 


Thr 


Gly 


Gly 


Lys 


Ser 


Trp 


Gly 










405 










410 










415 




Ser 


Glu 


Pro 


Cys 


Ala 


Leu 


Pro 


Glu 


Arg 


Pro 


Ser 


Leu 


Tyr 


Thr 


Lys 


Val 
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420 425 430 

Val His Tyr Arg Lys Trp He Lys Asp Thr He Val Ala Asn Pro Glu 
435 440 445 

Phe 



<210> 618 

<211> 3923 

<212> DNA 

<213> Homo sapien 

<400> 618 

acagaagaaa tagcaagtgc cgagaagctg gcatcagaaa aacagagggg agatttgtgt 60 

ggctgcagcc gagggagacc aggaagatct gcatggtggg aaggacctga tgatacagag 120 

gaattacaac acatatactt agtgtttcaa tgaacaccaa gataaataag tgaagagcta 180 

gtccgctgtg agtctcctca gtgacacagg gctggatcac catcgacggc actttctgag 240 

tactcagtgc agcaaagaaa gactacagac atctcaatgg caggggtgag aaataagaaa 300 

ggctgctgac tttaccatct gaggccacac atctgctgaa atggagataa ttaacatcac 360 

tagaaacagc aagatgacaa tataatgtct aagtagtgac atgtttttgc acatttccag 420 

cccctttaaa tatccacaca cacaggaagc acaaaaggaa gcacagagat ccctgggaga 480 

aatgcccggc cgccatcttg ggtcatcgat gagcctcgcc ctgtgcctgg tcccgcttgt 540 

gsgggaagga cattagaaaa tgaattgatg tgttccttaa aggatgggca ggaaaacaga 600 

tcctgttgtg gatatttatt tgaacgggat tacagatttg aaatgaagtc acaaagtgag 660 

cattaccaat gagaggaaaa cagacgagaa aatcttgatg gcttcacaag acatgcaaca 720 

aacaaaatgg aatactgtga tgacatgagg cagccaagct ggggaggaga taaccacggg 780 

gcagagggtc aggattctgg ccctgctgcc taaactgtgc gttcataacc aaatcatttc 840 

atatttctaa ccctcaaaac aaagctgttg taatatctga tctctacggt tccttctggg 900 

cccaacattc tccatatatc cagccacact catttttaat atttagttcc cagatctgta 960 

ctgtgacctt tctacactgt agaataacat tactcatttt gttcaaagac ccttcgtgtt 1020 

gctgcctaat atgtagctga ctgtttttcc taaggagtgt tctggcccag gggatctgtg 1080 

aacaggctgg gaagcatctc aagatctttc cagggttata cttactagca cacagcatga 1140 

tcattBcgga gtigaattatc taatcaacat catcctcagt gtctttgccc atactgaaat 1200 

tcatttccca cttttgtgcc cattctcaag acctcaaaat gtcattccat taatatcaca 1260 

ggattaactt ttttttttaa cctggaagaa ttcaatgtta catgcagcta tgggaattta 1320 

attacatatt ttgttttcca gtgcaaagat gactaagtcc tttatccctc ccctttgttt 1380 

gatttttttt ccagtataaa gttaaaatgc ttagccttgt actgaggctg tatacagcac 1440 

agcctctccc catccctcca gccttatctg tcatcaccat caacccctcc cataccacct 1500 

aaacaaaatc taacttgtaa ttccttgaac atgtcaggac atacattatt ccttctgcct 1560 

gagaagctct tccttgtctc ttaaatctag aatgatgtaa agttttgaat aagttgacta 1620 

tcttacttca tgcaaagaag ggacacatat gagattcatc atcacatgag acagcaaata 1680 

ctaaaagtgt aatttgatta taagagttta gataaatata tgaaatgcaa gagccacaga 1740 

gggaatgttt atggggcacg tttgtaagcc tgggatgtga agcaaaggca gggaacctca 1800 

tagtatctta tataatatac ttcatttctc tatctctatc acaatatcca acaagctttt 1860 

cacagaattc atgcagtgca aatccccaaa ggtaaccttt atccatttca tggtgagtgc 1920 

gctttagaat tttggcaaat catactggtc acttatctca actttgagat gtgtttgtcc 1980 

ttgtagttaa ttgaaagaaa tagggcactc ttgtgagcca ctttagggtt cactcctggc 2040 

aataaagaat ttacaaagag ctactcagga ccagttgtta agagctctgt gtgtgtgtgt 2100 

gtgtgtgtgt gagtgtacat gccaaagtgt gcctctctct cttgacccat tatttcagac 2160 

ttaaaacaag catgttttca aatggcacta tgagctgcca atgatgtatc accaccatat 2220 

ctcattattc tccagtaaat gtgataataa tgtcatctgt taacataaaa aaagtttgac 2280 

ttcacaaaag cagctggaaa tggacaacca caatatgcat aaatctaact cctaccatca 2340 

gctacacact gcttgacata tattgttaga agcacctcgc atttgtgggt tctcttaagc 2400 

aaaatacttg cattaggtct cagctggggc tgtgcatcag gcggtttgag aaatattcaa 2460 

ttctcagcag aagccagaat ttgaattccc tcatctttta ggaatcattt accaggtttg 2520 

gagaggattc agacagctca ggtgctttca ctaatgtctc tgaacttctg tccctctttg 2580 

tgttcatgga tagtccaata aataatgtta tctttgaact gatgctcata ggagagaata 2640 

taagaactct gagtgatatc aacattaggg attcaaagaa atattagatt taagctcaca 2700 
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ctcqtcaaaa qqaaccaaga tacaaagaac tctgagctgt catcgtcccc atctctgtga 2760 

q'^-'C^acaacca acagcaggac ccaacgcatg tctgagatcc ttaaarcaag gaaaccagrg 282C 

.. ■i^dtgagttg aatrctccta ttatggatgc tagcttctgg ccatc~ctgg ctctcctctt 2880 

jacacatatt agcttctagc ctttgcttcc acgactttta tcttttctcc aacacatcgc 2940 

ttaccaatcc tcrctctgct ctgttgcttt ggacttcccc acaagaattt caacgactct 3000 

caagtctttt cttccatccc caccactaac ctgaatgcct agacccttat ttttattaat 3060 

ttccaataga tgctgcctat gggctatatt gctttagatg aacattagat atttaaagct 3120 

caagaggttc aaaatccaac tcattatcrt ctctttcttt cacctccctg ctcctctccc 3180 

tatattactg attgcactga acagcatggt ccccaatgta gccatgcaaa tgagaaaccc 3240 

agtggctcct tgtggtacat gcatgcaaga ctgctgaagc cagaaggatg actgattacg 3300 

cctcargggt ggaggggacc actcctgggc cttcgtgatt gtcaggagca agacctgaga 3360 

tgctccctgc cttcagtgtc ctctgcatct cccctttcta atgaagatcc atagaatttg 3420 

ctacatttga gaattccaat taggaactca catgttttat ctgccctatc aattttttaa 3480 

acttgctgaa aattaagttt tttcaaaatc tgtccttgta aattactttt tcttacagtg 3540 

tcttggcata ctatatcaac tttgattctt tgttacaact tttcttactc ttttatcacc 3600 

aaagtggctt ttattctctt tattattatt attttctttt actactatat tacgttgtta 3660 

ttattttgtt ctctatagta tcaatttatt tgatttagtt tcaatttatt tttattgctg 3720 

acttttaaaa taagtgattc ggggggtggg agaacagggg agggagagca ttaggacaaa 3780 

tacctaatgc atgtgggact taaaacctag atgatgggtt gataggtgca gcaaaccact 3840 

atggcacacg tatacctgtg taacaaacct acacattctg cacatgtatc ccagaacgta 3900 

aagtaaaatt taaaaaaaag tga 3923 



<210> 619 

<211> 3674 

<212> DNA 

<213> Homo sapien 



<400> 619 

agaaagtttc cttttttttt tttaatggtg aaaagatata cacatattta gaattagcca 60 

qctgggctca gtttagatta ttccaatttt gttggcaaca tccagagcat cgtaatcagg 120 

agccagtgaa acatattcct tcttctctcc atcaggccaa atcacggtgt tgaccttggc 180 

cacatcaatg tcttagaact tcttcacagc ctgtttgatc tggtgcttgt tggctttaac 240 

acccacaatg aacacaagtg tgttgttgtc ttctatcttc ttcgtggtga ctcagtggtc 300 

agcggaaact tgatgatagc gtagtggtca agcttgtatc tcctgggagc gctcttccaa 360 

agatatttgg gctgcctcgg gagttgcagc gtcttgggcc gccggaaggt gggtgacgta 420 

cggatcttct ttttttgtgt ggctgtggac acctttcaac actgtcttct tggcctttaa 480 

atccttcgct ttggtttcgg ctataggagg ggcaggagct tccttcttca ctttcggcgc 540 

catcttgtga aaagggaaag tttcctttct aataccattt tcacttctcc cgaattttgt 600 

ggatcgtttc ttggtatcta ccccagattt caggagtgtt ggctggatct tagggattgt 660 

gaagtcttca tttccctgtg gtgagatctg aggcatgatt ttaaacagtg tgagggaagg 720 

agatctccag gcactttaat agaatggaga agcaggatgg gatttgagag gaaatctgat 780 

tttgaaaaaa ggagaactag agttgagttc gtaattaact agcaccttaa aggtcattca 840 

gcatgcccat ctgcacagtg ggtgtaatca ccctacagaa caaaaacaaa aaggcaatgg 900 

agaggaagct gtaaagcact gtacatgttt aactcattgt tatgtaagct agccgaaggc 960 

ttcacagact tgaattcatc tcccaagttc tcttcctgta ctggaaactc tgccttaggt 1020 

tgcttaaaac ttgagaaaca gaatattgct tcccctgcct gccttcttga gtacacttgc 1080 

ctacacaaag atgcacatcc ttgtttgtgt gtgtgtgtcc atttgctgtg acattcttgt 1140 

gaaagtcaaa gtttcccagc tgttgacata cacaagtttg tttggtgcaa cctgtcagat 1200 

gcatccctta gacaggccct ttgatactct gggaaagaca ttggacttac agtcggaacg 1260 

aaaagaaaga aatgtgatat gtatagcgtg cagtgagttg gagttttacc tgtattgttt 1320 

taatttcaac aagcctgagg actagccaca aatgtaccca gtttacaaat gaggaaacag 1380 

gtq":aaaaag gttgttacct gtcaaaggtc gtatgtggca gagccaagat ttgagcccag 1440 

LcaLgtctga tgaacttagc ctatgctctt taaacttctg aatgctgacc attgaggata 1500 

tctaaactta gatcaattgc attttccctc caagactatt tacttatcaa tacaataata 1560 

ccci 'ctttac caatctattg ttttgatacg agactcaaat atgccagata tatgtaaaag 1620 

cjciacctacaa gctctctaat catgctcacc taaaagattc ccgggatcta ataggctcaa 1680 

agaaacttct tctagaaata taaaagagaa aattggatta tgcaaaaatt cattattaat 1740 
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ttttttcatc catcctttaa ttcagcaaac atttatctgt tgttgacttt atgcagtatg 1800 

gccttttaag gattggggga caggtgaaga acggggtgcc agaatgcatc ctcctactaa 1860 

tgaggtcagt acacatttgc attttaaaat gccctgtcca gctgggcatg gtggatcatg 1920 

cctgtaatct caacattgga aggccaaggc aggaggattg cttcagccca ggagttcaag 1980 

accagcctgg gcaacataga aagaccccat ctctcaatca atcaatcaat gccctgtctt 2040 

tgaaaataaa actctttaag aaaggtttaa tgggcagggt gtggtagctc atgcctataa 2100 

tacagcactt tgggaggctg aggcaggagg atcactttag cccagaagtt caagaccagc 2160 

ctgggcaaca agtgacacct catctcaatt ttttaataaa atgaatacat acataaggaa 2220 

agataaaaag aaaagtttaa tgaaagaata cagtataaaa caaatctctt ggacctaaaa 2280 

gtatttttgt tcaagccaaa tattgtgaat cacctctctg tgttgaggat acagaatatc 2340 

taagcccagg aaactgagca gaaagttcat gtactaacta atcaacccga ggcaaggcaa 2400 

aaatgagact aactaatcaa tccgaggcaa ggggcaaatt agacggaacc tgactctggt 24 60 

ctattaagcg acaactttcc ctctgttgta tttttctttt attcaatgta aaaggataaa 2520 

aactctctaa aactaaaaac aatgtttgtc aggagttaca aaccatgacc aactaattat 2580 

ggggaatcat aaaatatgac tgtatgagat cttgatggtt tacaaagtgt acccactgtt 2640 

aatcacttta aacattaatg aacttaaaaa tgaatttacg gagattggaa tgtttctttc 2700 

ctgttgtatt agttggctca ggctgccata acaaaatacc acagactggg aggcttaagt 2760 

aacagaaatt catttctcac agttctgggg gctggaagtc cacgatcaag gtgcaggaaa 2820 

ggcaggcttc attctgaggc ccctctcttg gctcacatgt ggccaccctc ccactgcgtg 2880 

ctcacatgac ctctttgtgc tcctggaaag agggtgtggg ggacagaggg aaagagaagg 2940 

agagggaact ctctggtgtc tcgtctttca aggaccctaa cctgggccac tttggcccag 3000 

gcactgtggg gtggggggtt gtggctgctc tgctctgagt ggccaagata aagcaacaga 3060 

aaaatgtcca aagctgtgca gcaaagacaa gccaccgaac agggatctgc tcatcagtgt 3120 

ggggacctcc aagtcggcca ccctggaggc aagcccccain agagcccatg caaggtggca 3180 

gcagcagaag aagggaattg tccctgtcct tggcacattc ctcaccgacc tggtgatgct 3240 

ggacactgcg atgaatggta atgtggatga gaatatgatg gactcccaga aaaggagacc 3300 

cagctgctca ggtggctgca aatcattaca gccttcatcc tggggaggaa ctgggggcct 3360 

ggttctgggt cagagagcag cccagtgagg gtgagagcta cagcctgtcc tgccagctgg 3420 

atccccagtc ccggtcaacc agtaatcaag gctgagcaga tcaggcttcc cggagctggt 3480 

cttgggaagc cagccctggg gtgagttggc tcctgctgtg gtactgagac aatattgtca 3540 

taaattcaat gcgcccttgt atcccttttt cttttttatc tgtctacatc tataatcact 360O 

atgcatacta gtctttgtta gtgtttctat tcmacttaat agagatatgt tatacttaaa 3660 

aaaaaaaaaa aaaa 3674 

<210> 620 

<211> 2051 

<212> DNA 

<213> Homo sapien 



<220> 

<221> inisc_f eature 
<222> (1) , . . (2051) 
<223> n = A,T,C or G 



<400> 620 

ggaccagggg ctgaagtgaa cccccagcac agcacagctg ctctataaaa acgtggccag 60 

actttttttt ttgaagcaag tccctgttct tgttcgtcct gactagtccc atcagggccc 120 

tggatcccaa gactcagcat ccaaggtccc ctccaggaat cctggcagct cagcatactt 180 

tatcctgttt catctgagag caaaaatgta aaattggatg cacagaaaag tgactcaaag 240 

tgcttaatga ctagaagaaa tctaggagca gcaagaagag caggacaaac aggccaggcg 300 

gtgtcaggag cccaggtctc cagctggang gaacgtcaac cctgcagtgg gagcaggggc 360 

cctttgcaca tcctaggcac agatggtaat gtagacacca caggtaagct gggcttggta 420 

cctacccctc cccggattca gaaagaaacc aaacaaggag ctttgtgtgg aatgaaacct 480 

Gctttcctcc cagaagcact gctgactgtt tggtggttgc catttgtggc agtgagccct 540 

tgtttgttct gaggttgggc tggtttctcc tcttggccct gccctacaga tcataaagga 600 

gaacagcaag acgtccccag caaacatcca cagatggcct tggaaataag tcaccttcct 660 

caccctgcag gaatgccagt gaacatattg ctgacatctt ggagctcagt acctcatagt 720 

gtaacggcgt cagtagatct gcctgtgctg ggacttcctg tactacccat tcctgagggg 780 
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jgatgcttct gcagggcctg tgacttggtg cacaacttca gacaccatca rcttgcagca 840 

qcaccgcacc ctcacragcc agggtgttga tgacttccrc aaggccaagg ccacattcaa 900 

KTcttcggac ttcantgatg cgcttgtgct gagcaaggtg gcttctccgg qatcttaatt 960 

jciggaggtag aatggagctt gagatcaagt gtctgatcaa gcctcagtgt atgggcgctg 1020 

f t catcctct ggtgctgaag cagccaagag acccaagtct gcctggctqc ctcttaggat 1080 

argacagcag agccagtggc ctctactaga tcctgtacaa cctcacaaaa cacccagaca 1140 

tcgggagtgc tgccagcctg tgatgcaaga gtcctaatcc tgaagacatt gaatgacctg 1200 

tcgttgtgct gtttttacca aaaaggatca tgaggatcag agaggaaaag tcacttgccc 1260 

aaagtcacac agctgaacag tggtggagtt caacttrgac cgtgggctgt ctggccccca 1320 

aggtgtatgc ttgcrtctct cccaagagac tcctttctta tcaggctcaa atgaatgaaa 1380 

qgaggatgtt aaagacaacg ccattattga cgagatcact cccaagcgga ttggagattg 1440 

tcccaatatt tagacctata gcaaggcctt gggagaaatg gtggtgcagc aggagagcag 1500 

gaacctaacc attgccatcc taaggccctc cattgtgtgg agcaacgtgg caccagcttt 1560 

tcctgggttg ggttgataat ctaaatggat gtagccgact cattattgcg gtatgtatag 1620 

ggatgaagaa gtaactgtaa tgtagtggag gaatagtaag aaaattctta gtgctggctt 1680 

agcttaattg atccaaaaac ataaatgcta ctttactatc aattgaagca tattatttca 1740 

attattctgg ttataatatg gaggcaggat gaaattgttt ttattctttt agaatttttt 1800 

tttatcagga aaacagaggt aaagtgctat caattactat ttaagagttc tattttgaaa 1860 

agtgagaatt aaggattttt cttttctttt taaaaaaaac ttttttaaaa attaaaaata 1920 

aaagaagcaa aagtcttagg aaaatgaagc aagtagccct gccactctat gtacagtaat 1980 

aacaatatct gtcccagtta ttatgtacaa tattataaaa aatgtcgcag acagtaaaaa 2040 

aaaaaaaaaa a 2051 



<210> 621 

<211> 2841 

<212> DNA 

<213> Homo sapien 

<220> 

<221> inisc_f eature 
<222> (1) . . . (2841) 
<.223> n = A, T, C or G 

<400> 621 

gcagagcaca gcatagctgc tttaccaaat 
ctgatcctgt tccacctcac tggacaggac 
cagcatccct tggccaatgg aaatttgaaa 
ggcaggccac cacctttgct gtttgggtga 
gcccaagctg acaaggggta gaagaggtgc 
atggccagac tcttgtttaa gtcagtcccc 
Ltcaaccagg gtctctggct accttgactg 
tccctgagtc agagctcccg gggggaggac 
cagccatttc agccttaggg cttcagagtg 
ccagcacagc acagctgctg tataaaaacg 
gttcttattc ctcctgacta ggtaagactt 
tgtgttcaga ttggcaacag gtttgtacct 
aggctatcat cttccctgga aaatacgagt 
tccacagcag cccttcagaa aagtggccag 
cctgtgtctc tagccagccc accactggag 
ttggacagag cttccaggag caaatgaaat 
cccttgctat cctcagaaga tatatcacgg 
ctccaataag ctgcagttga cccaaagaac 
dc^actccact actcatcacc agacagggaa 
catcctgggc cgattacact gagtgattgc 
aggagacaag caaagtggtg gagcagcaag 
aagctatctc tctgggctcc acttgccctt 
ctgcttgcct ctcccagggc cccagcctgg 
c:ccataccat agtttctgtg ctagtggacc 



catggccaga ctgcttctgt aagcaggccc 60 

ctcccaactg gggcctccag ctacccccac 120 

tgttcctggg acagagctcc tggagagagg 180 

ctagccgttc tggcctgcag gctttggaga 240 

ctcagcacag cacagccacg ctacgaaaac 300 

gaacacattt ctagtcagtg ggtgaagtct 360 

ctgttctctg gccgacagag gtctcaggcc 420 

cagattgtca tctttgctgt ttgggtgacc 480 

tctgaggtag ccaggggctg aagtgaaccc 540 

tggccagact ttttctttaa gcaagtccct 600 

ctcaacttgc ctccagccac atcttattgg 660 

cagtggtaca gagctcccag aggaaggggt 720 

caattaggga cttgagggga cccccagcat 780 

actctgtact tgatgggcag atcctcctgg 840 

ctatcaagcc agtagcaact cagcagttcc 900 

cctttctgcc actgcctttg cagtgaactg 960 

gagcaaagac cctaagtgcc atatcaacac 1020 

aagccaatcc atctcccaca ggttccacac 1080 

ccctggcttg ggcccacagc acagaccctc 1140 

taactcacat gtctctggga tggagcaccc 1200 

tcaggtgatg tggagcccag agggcaggga 1260 

gtgagacact ttgtcccagc actccttagt 1320 

ccacacctgc ttacagggca ctctcagatg 1380 

gtaccatatc agtggagagc tgcagcaagg 1440 
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tggcccntac ggccacgcac cagcctgcac attacctctc catactgcag ccctttatat 1500 

ggaaacttcc tacatcactt tgctgtgtgt gtttacacag gtggattttg ctttacttgc 1560 

actgacagca cacaggaggg cagcacacac cccaacccac atcaactgcc attaaagaaa 1620 

agaaatttca gcccataatt tcatgtccag caaaattagg catcataagt gaaggagaaa 1680 

taagatcctt ttcagacaag caaatgctga gggaattcaa tatcaccaga tctaccttac 1740 

aagagctcct gaaggaagca ctaaatatgg aaagaaaaaa ccatcaccag ccactacaaa 180O 

aatgcagtga agaacgcagt gaattacgca gtccagtgat gctaaaaacc aaccacatac 1860 

gttaagtctg caaaataacc agctgacagc atgacgacag gataaatcca cacataccat 1920 

tactaacctt aaatgaaaat gggctaaatg ctcccattga aagacatggg gcaagctgga 1980 

taaagaacca agacccactg gagtatgctg tcttcaagaa acccatctca catgcggtgg 2040 

catacatagg ctcaaaataa aggaatggag aaaaatattt caagcaaatg gaaaacagaa 2100 

aaaagcaggt gttgcactcc tactttctga caaaacagac tatgcgaata aagataaaaa 2160 

agagaaggac attacaaagg tggtcctgac ctttgatata tctcattgct tgataccaac 2220 

ctgggctgtt ttaattgccc aaanccaata ggataatttg ctgaggttgt ggagcttctc 2280 

ccctgcagag agtccctgat ctcccaaaat ttggttgaga tgtaaggttg attttgctgt 2340 

acaactcctt ttctgaagtt ttactcattt ccaaaaagga aggcaagttt tcctgcttcc 24 OO 

atgacgatgg agagcaggca tctcctttcc tgagtttcag cttgcttctg acagggaagg 24 60 

tgagtgtaag ttttttccag cttctaagat ggcagagaac gatcaccagc ctgagcctta 2520 

tttccaggta agtagctgaa ttagagtttt gtcttaaaat ttttccttaa tgattaaaat 2580 

gtaagattac ccaccagctg cttttaattt ctcccttagc attagaacac tcagtaatca 2640 

tatgaattgt gcatttgttt gttttgctta actctttctg tttgtttatg tttggggttt 2700 

tattgttgtt gtttcacttt tctcccatct cttcctgact tggtcaaatc caaaggaatg 27 60 

ttcgaaattg tggggagcaa ggcatctgaa atggctaaaa ctcctgtggc tgcaaaaaat 2820 

agaaataaaa aaaaaaaaaa a 2841 

<210> 622 

<211> 3228 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) ... (5228 ) 
<223> n = A,T,C or G 

<400> 622 

tccgccccat tgacgcaaat ggcggtaggc gtgtacggtg ggaggtctat ataagcagag 60 

ctctcnggct aactagagaa cccactgctt actggcttat cgaaattaat acgactcact 120 

atagggagac ccaagctggc tagcgtttaa acttaagctt ggtaccgagc tcggatccac 180 

tagtccagtg tggtggaatt ccattgtgtt gggcaggaaa caagcaaagt ggtggagcag 240 

caagtcaggt gatgtggagc ccagaggtca gggatggctg tctctctagg gtccacttgc 300 

ccttgtgaga cactttatcc cagcacttta ggaatactga ggtcatacca gccacatctt 360 

atatgcaaga ttgcccagca gagatcaggt ccgagagttc cctttttaaa aaaaggagac 420 

ttgcttaata aaagaagtct agccacgttt gtgtagagcg gctgtgctgt gctgggggtt 480 

cacttttgag agagttctcc tctgagacct gatctctgga ggctgggcaa tcttgcactt 540 

gagatggggc tggtctgatc tcagcactcc ttagtctgct cgcctctccc atggccccag 600 

cctggccaca cctgcttacg gggcactctt agatgcccac accataactt ccatgctagt 660 

ggactgtacc atatcagtgg agagctgcag caaggtggcc cctagagcca cgcaccagcc 720 

tgcacattgc ctctccatac ggcagccctt tatttggaaa cttcctaaat cactttgctg 780 

tgtgtgttta cacgggtgtg ttttgcttta cttgccctga gagcacacgg gagtgcagca 840 

cacaccccaa cccacatcaa ctgccattaa agaaaagaaa tttcagccca gaatttcatg 900 

tccagcaaaa ttaagcatca taagtgaagg agaaataaga tccttttcag acaagcaagt 960 

gctgagggaa tttggtatca ccagatctac cttacgagag ctcctgaagg aagcactaaa 1020 

tatggaaaga aaagatcatc acctgctact acaaaaacac actgaagtac acagtccaat 1080 

gatgctaaaa agcaagcaca tatgtaagtc tgcaaaataa ccagctgaca gcatgacgac 1140 

aggataaaat ccacacatac cattactaac cttaaatgta aatgggctaa atgctcccat 1200 

tgaaagacac ggggcaagct gggtaaagaa ccaagaccca ctggagtatg ccgtcttcaa 1260 

gcaacccatc tcacgtgcag tgccatacat aggctcaaaa taaaggaatg gagaaaaata 1320 
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t t caagcaa 
tctaccaata 
atcLcattat 
t t gctgaggt 
agatgtaagg 
ggaaggcaag 
: cagcttgcr 
gacgtgcatt 
gagt gagaaa 
agttcaggga 
gcaagaggtc 
tgcaggaatg 
cggcgtcagc 
gcttctgcag 
cgcaccctca 
tcggacttca 
aggtagaatg 
tent ctggtg 
cagcagagcc 
gagtgctgcc 
tctgctgttt 
tcacacagct 
t at gctt get 
at gttaaagg 
t gtgttctca 
ctttagtgag 
aagagagatt 
tttctttact 
aattttcatg 
ttttntttta 
gataatacat 
act aagtcct 



atggaaaaca 
aagat aaaaa 
tgcttgatac 
tgtggagctt 
ttgattttgc 
1 1 ttcctgct 
tctgacaggg 
gggcctgtgg 
gcttacccaa 
ctccttggca 
cccggcaaac 
ccagtgaact 
agatctgcct 
ggcctgtgac 
ctagccaggg 
ttgatgcgct 
gagcttgaga 
ctgaagcagc 
agtggcctct 
agcctgtgat 
ttaccaaaaa 
gaacagtggt 
tct ctcccaa 
taggatctct 
ttcttacctc 
atcatatatt 
ttcacagaaa 
gctcataaaa 
ttaaaattga 
gaattttcca 
acaataaccc 
aattttgtat 



gaaaaaaggt 
aagagaagga 
caacctgogc 
ctccccttca 
tgt acaactc 
tccattgaca 
aaggagcttt 
tcccat ttgt 
tgcctgtacc 
gaagagaccg 
atccacagat 
tattgctgac 
gtgctgggac 
ttggtgcaca 
tgttgatgac 
tgtgctgagc 
tcaagtgtct 
caagagaccc 
actagatcct 
gcaagagtcc 
ggatcatgag 
ggagttcaac 
gagacaactt 
gaagcctgtg 
attaagagta 
at tagtaaga 
cagatatata 
attagtcctt 
cagat acatt 
ttattaatgt 
tttgtttttc 
taaaatttta 



gttgcact cc 
cattacaaag 
tatttgtatt 
cagagt ccct 
cttttttgaa 
aaggagagca 
gagattt gaa 
gttatttttc 
atcatcgt ac 
tagccttgta 
ggccttggaa 
atctt ggagc 
ttcctgtact 
acttcagaca 
ttcct caagg 
aaggtggctt 
gatcaagcct 
aagtctgcct 
gtacaacctc 
taatcctgaa 
gatcagagag 
tttgaccgtg 
tcttatcagg 
ccagtggaac 
aagtttattg 
actgggacca 
cctgtaagta 
attagaatgt 
tttaaattgt 
tatttttatg 
aaattgaaaa 
aattttcaaa 



cagttt ctga 
gtggtcctga 
gcccaaacga 
gatct ccgaa 
gtt "c t actca 
ggcacct cct 
tact ggcctg 
t gggaaattt 
cttaaaagaa 
tcagatcata 
ataagtcacc 
t cagtaccct 
acccattcct 
ccatcat ctt 
ccaaggccac 
ctccgggatc 
cagtgtatgg 
ggctgcntct 
acaaaacacc 
gacat tgaat 
gaaaagt cac 
ggctgtctga 
ct caaatgaa 
cgcagct cat 
agtttattga 
aacagatttt 
tacagacacg 
gggatgtata 
cctaaaataa 
agaaactata 
tacagtgtat 
aaaaaaaa 



caaaacagac 
cctttgataa 
at aggat aar 
aattt ggttg 
t ttccaacaa 
t t cctgagtt 
ctgggt t tt g 
cttccct ttg 
ctccatttta 
aaggagaaga 
ttgct caccc 
catagtgtaa 
gaggggcgat 
gcagcagcac 
attcaaggct 
ttaatt cagg 
gcgctgttca 
taggat atga 
cagacat egg 
gacctgt cat 
ttgcccaaag 
ccccaaggtg 
tgaaaggagg 
ggctggcacc 
atttaagtat 
ctgact ctaa 
catacacaca 
aatgtaagag 
atttaattat 
taactttatt 
tttgcaaata 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3228 



<210> 623 

<211> 4894 

<212> DNA 

<213> Homo sapiens 



<400> 623 

ctgcacgcgc 

gtgtccccac 

ggcaccaaag 

aggaggagag 

acggccccct 

ctgctgcggc 

gtgtgtttgg 

gagaagttca 

at acatgcca 

gaggtggtct 

accaagagaa 

gcacagagcc 

ttataattta 

tt acagagaa 

gragctgaac 

cct aggacaa 

zii ■h z^i ^ 

uCi»-( i^t-ivj \^ d d ^ 

gcaggctctg 



tggct ccggg 
tgaggtgccc 
ggctggcaga 
gccgcagctt 
gagccctacc 
accggaaagc 
ccgcaggcat 
tgaccatggt 
cctggcatgt 
gctgggaagg 
agaagagaat 
tggct catag 
tagcgattat 
ggaagcagag 
agtagagctg 
ggggtggact 
cattgtgtga 
gagtcacact 



tgacagccgc 
cacagcagca 
aatgggcgcc 
ctggagcaga 
cgcctggccc 
ccagctcttg 
cacctatgtg 
■gctgggtgag 
gggacaggag 
cattgctggg 
accaaggaca 
gagacactgg 
ctcatttaat 
gcttt t aaga 
ggact ccatc 
cctgactctg 
caacccctca 
gcttgttgaa 



gcgcct cggc 
ggtgttgagc 
tggctgattc 
gccgagacga 
actatggtcc 
ctggtcaacc 
ccgcctctgc 
tcactacat c 
tacctctgcc 
caggagggtg 
taccccagtc 
agaaatgctc 
gcttacaacc 
ggtt aggt aa 
aaggtctccc 
cagataaatt 
cccccaggaa 
acgctgcctc 



caggatctga 
atgggctgag 
ctaggcagtt 
agcagttctg 
agaggctgtg 
tgctaacctt 
tgct ggaagt 
ctccttcctt 
ctgggagctg 
accctgggct 
acctctggat 
ctaacctttg 
accatttgag 
gtcttagcca 
agccggagct 
ctacaaaagc 

'^y gy y ^^^^ 

ttaccctccc 



gtgatgagac 
aagctggacc 
ggcggcagca 
gagtgcctga 
ggtgagccgc 
tggcctggag 
gggggtagag 
cctgttccag 
cttggaggga 
gagggggcac 
ccctggt cct 
gctagccctt 
gtgatccatt 
aagccaaata 
tgctcct acc 
cacagaaggc 

taggtctgcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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cctttgaata agtatcactt cttagttgct 
gggcatctgt aatgcctgtg ttatgaggag 
acagtgtcga gtacggaatg ttatttccat 
ttgcccttta cttgtcccag ccattgactc 
cagtgctggg cctggtctgt gtcccgctcc 
gctatggccg ccgccggccc ttcatctggg 
ttctcatccc aagggccggc tggctagcag 
agctggcact gctcatcctg ggcgtggggc 
ctccactgga ggccctgctc tctgacctct 
actctgtcta tgccttcatg atcagtcttg 
ttgactggga caccagtgcc ctggccccct 
gcctgctcac cctcatcttc ctcacctgcg 
cagcgctggg ccccaccgag ccagcagaag 
gctgtccatg ccgggcccgc ttggctttcc 
accagctgtg ctgccgcatg ccccgcaccc 
gctggatggc actcatgacc ttcacgctgt 
accagggcgt gcccagagct gagccgggca 
aggccttggc agccagcaga ggctggtgtg 
ctgtgtctgg gctggtgcct ctccatcctg 
tggaccccat ctgcatacac ggcttctcat 
ggaaggcctc tggctgctct aggagtctga 
aaaggcggag cttattcaaa gtctagaggg 
tgcctggttc cagccgcagt gtgccctctg 
cagcgccttc cagctcaggc gtcctagaag 
tgttccctct cacccgcctg tcctcacagc 
tcctctgcct tcagcaaggg gcgttgccca 
actcccattg ctagaggtag aaaggggaag 
ggtctcgtgc agcaggtacc tgtggttccg 
cttccctccc aggctctgtc tgatggcccc 
ctggggctgt tcctgcagtg cgccatctcc 
gtgcagcgat tcggcactcg agcagtctat 
gccggtgcca catgcctgtc ccacagtgtg 
gggttcacc"t tctcayccct gcagatcctg 
gagaagcagg tgttcctgcc caaataccga 
agcctgatga ccagcttcct gccaggccct 
gtgggtgytg gaggcagtgg cctgctccca 
tgtgatgtct ccgtacgtgt ggtggtgggt 
cggggcatct gcctggacct cgccatcctg 
ccatccctgt ttatgggctc cattgtccag 
tctgccgcag gcctgggtct ggtcgccatt 
agcgacttgg ccaaatactc agcgtagaaa 
ctcactgggt cccagctccc tgctcctgtt 
gtttctgttg ctgccaaagt aatgtggctc 
tagctgcaca gctgggggct ggggcgtccc 
tgactggagg ccttccaagg gggtttcagt 
ccatgcactg gaatgcgggg actctgcagg 
gcctcctagt tgagacacac ctagagaagg 
tggtttccca tctctaagcc ccttaacctg 
agtttctagg atgaaacact ccaccatggg 
aagagtcctg aggggcaaca cacaagaacc 
tgctgatcca cccccctctt accttttatc 
atcacagaga cacaggcatt taaatattta 
ccattgctag cttttctgtg ttggtgtcta 
atcaggtccc ctgagatagc tggtcattgg 
gggtctggcc ccccaaaatg cctaacccag 
taattccaaa tgctgttacc caaggttagg 
aggtctcaac ggcttcccta accacccctc 
tccactcccc tctactctct ctaggactgg 



ccatgcctca gtttgtccat ctgaaaatgg 1140 
taaattacag catccctgtg aagacgtagc 1200 
ccttctcacg gagcttggtt ccccttcccc 1260 
atactacttc ccttcttgca ggcattggtc 1320 
taggctcagc cagtgaccac tggcgtggac 1380 
cactgtcctt gggcatcctg ctgagcctct 1440 
ggctgctgtg cccggatccc aggcccctgg 1500 
tgctggactt ctgtggccag gtgtgcttca 1560 
tccgggaccc ggaccactgt cgccaggcct 1620 
ggggctgcct gggctacctc ctgcctgcca 1680 
acctgggcac ccaggaggag tgcctctttg 1740 
tagcagccac actgctggtg gctgaggagg 1800 
ggctgtcggc cccctccttg tcgccccact 1860 
ggaacctggg cgccctgctt ccccggctgc 1920 
tgcgccggct cttcgtggct gagctgtgca 1980 
tttacacgga tttcgtgggc gaggggctgt 2040 
ccgaggcccg gagacactat gatgaaggta 2100 
ggagccgccc accagagacg acactcgggg 2160 
gccccgactt ctctgtcagg aaagtgggga 2220 
gggtgtggaa catctctgct tgcggtttca 2280 
tcagagtcgt tgccccagtt tgacagaagg 2340 
agtggaggag ttaaggctgg atttcagatc 2400 
ctcccccaac gactttccaa ataatctcac 2460 
cgtcttgaag cctatggcca gctgtctttg 2520 
tgagactccc aggaaacctt cagactacct 2580 
cattctctga gggtcagtgg aagaacctag 2640 
ggtgctgggg agcagggctg gtccacagca 2700 
ccttctcatc tccctgagac tgctccgacc 2760 
tctccctctg caggcgttcg gatgggcagc 2820 
ctggtcttct ctctggtcat ggaccggctg 2880 
ttggccagtg tggcagcttt ccctgtggct 294 0 
gccgtggtga cagcttcagc cgccctcacc 3000 
ccctacacac tggcctccct ctaccaccgg 3060 
ggggacactg gaggtgctag cagtgaggac 3120 
aagcctggag ctcccttccc taatggacac 3180 
Gctccacccg cgctctgcgg ggcctctgcc 3240 
gagcccaccg aggccagggt ggttccgggc 3300 
gatagtgcct tcctgctgtc ccaggtggcc 3360 
ctcagccagt ctgtcactgc ctatatggtg 3420 
tactttgcta cacaggtagt atttgacaag 3480 
acttccagca cattggggtg gagggcctgc 354 0 
agccccatgg ggctgccggg ctggccgcca 3600 
tctgctgcca ccctgtgctg ctgaggtgcg 3660 
tctcctctct ccccagtctc tagggctgcc 3720 
ctggacttat acagggaggc cagaagggct 378 0 
tggattaccc aggctcaggg ttaacagcta 3840 
gtttttggga gctgaataaa ctcagtcacc 3900 
cagcttcgtt taatgtagct cttgcatggg 3960 
atttgaacat atgaaagtta tttgtagggg 4020 
aggtcccctc agcccacagc actgtctttt 4080 
aggatgtggc ctgttggtcc ttctgttgcc 414 0 
acttatttat ttaacaaagt agaagggaat 4200 
atatttgggt agggtggggg atccccaaca 4260 
gctgatcatt gccagaatct tcttctcctg 4320 
gaccttggaa attctactca tcccaaatga 4380 
gtgttgaagg aaggtagagg gtggggcttc 444 0 
ttctcttggc ccagcctggt tccccccact 4500 
gctgatgaag gcactgccca aaatttcccc 4560 
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Ucicccccaac tttcccctac ccccaacrtt ccccaccaqc tccacaaccc tgtrtggagc 4620 

tacrqcaqga ccagaagcac aaagtgcggt ttcccaagcc tttgtccatc tcagccccca 4 680 

t;. ::.tatct gtgcttgggg aatctcacac agaaactcag gagcaccccc tgcctgagct 4740 

a:,gggaggtc ttatctctca gggggggttt aagtgccgtt tgcaataatg tcgtcttatt 4800 

* 01 t -aqcgg ggtgaatatt ttatactgta agtgagcaat cagagtataa tgtttatggt 4860 

gacaaaatta aaggctttct taratgttta aaaa 4894 



■ :;:o> 62 4 

<::.: L> 2904 
:2i2> DNA 

<2i3> Homo sapiens 
<400> 624 

gtctatgcct tcatgatcag tcttgggggc tgcctgggct acctcctgcc tgccattgac 60 
tgggacacca gtgccctggc cccctacctg ggcacccagg aggagtgcct ctttggcctg 120 
ctcdccctca tcttcctcac ctgcgtagca gccacactgc tggtggctga ggaggcagcg 180 
ctgggcccca ccgagccagc agaagggctg tcggccccct ccttgtcgcc ccactgctgt 240 
ccatgccggg cccgcttggc tttccggaac ctgggcgccc tgcttccccg gctgcaccag 300 
ctgtgctgcG gcatgccccg caccctgcgc cggctcttcg tggctgagct gtgcagctgg 360 
atggcactca tgaccttcac gctgttttac acggatttcg tgggcgaggg gctgtaccag 420 
qgc:gtgccca gagctgagcc gggcaccgag gcccggagac actatgatga aggaaggcct 480 
c:lggctgctc taggagtctg atcagagtcg ttgccccagt ttgacagaag gaaaggcgga 54 0 
gcttattcaa agtctagagg gagtggagga gttaaggctg gatttcagat ctgcctggtt 600 
ccagccgcag tgtgccctct gctcccccaa cgactttcca aataatctca ccagcgcctt 660 
ccagctcagg cgtcctagaa gcgtcttgaa gcctatggcc agctgtcttt gtgttccctc 720 
tcacccgcct gtcctcacag ctgagactcc caggaaacct tcagactacc ttcctctgcc 780 
ttcagcaagg ggcgttgccc acattctctg agggcgttcg gatgggcagc ctggggctgt 84 0 
tcctgcagtg cgccatctcc ctggtcttct ctctggtcat ggaccggctg gtgcagcgat 900 
tcggcactcg agcagtctat ttggccagtg tggcagcttt ccctgtggct gccggtgcca 960 
catgcctgtc ccacagtgtg gccgtggtga cagcttcagc cgccctcacc gggttcacct 1020 
tctcagccct gcagatcctg ccctacacac tggcctccct ctaccaccgg gagaagcagg 1080 
t.gttcctgcc caaataccga ggggacactg gaggtgctag cagtgaggac agcctgatga 1140 
ccagcttcct gccaggccct aagcctggag ctcccttccc taatggacac gtgggtgctg 1200 
gaggcagtgg cctgctccca cctccacccg cgctctgcgg ggcctctgcc tgtgatgtct 1260 
Gcgtacgtgt ggtggtgggt gagcccaccg aggccagggt ggttccgggc cggggcatct 1320 
gcctggacct cgccatcctg gatagtgcct tcctgctgtc ccaggtggcc ccatccctgt 1380 
ttatgggctc cattgtccag ctcagccagt ctgtcactgc ctatatggtg tctgccgcag 1440 
gcctgggtct ggtcgccatt tactttgcta cacaggtagt atttgacaag agcgacttgg 1500 
ccaaatactc agcgtagaaa acttccagca cattggggtg gagggcctgc ctcactgggt 1560 
r.ccagctccc cgctcctgtt agccccatgg ggctgccggg ctggccgcca gtttctgttg 1620 
ctgccaaagt aatgtggctc tctgctgcca ccctgtgctg ctgaggtgcg tagctgcaca 1680 
gctgggggct ggggcgtccc tctcctctct ccccagtctc tagggctgcc tgactggagg 1740 
ccttccaagg gggtttcagt ctggacttat acagggaggc cagaagggct ccatgcactg 1800 
yaatgcgggg actctgcagg tggattaccc aggctcaggg ttaacagcta gcctcctagt 1860 
tgagacacac ctagagaagg gtttttggga gctgaataaa ctcagtcacc tggtttccca 1920 
tctctaagcc ccttaacctg cagcttcgtt taatgtagct cttgcatggg agtttctagg 1980 
atgaaacact cctccatggg atttgaacat atgaaagtta tttgtagggg aagagtcctg 2040 
aggggcaaca cacaagaacc aggtcccctc agcccacagc actgtctttt tgctgatcca 2100 
cccccctctt accttttatc aggatgtggc ctgttggtcc ttctgttgcc atcacagaga 2160 
cacaggcatt taaatattta acttatttat ttaacaaagt agaagggaat ccattgctag 2220 
cttttctgtg ttggtgtcta atatttgggt agggtggggg atccccaaca atcaggtccc 2280 
ctgagatagc tggtcattgg gctgatcatt gccagaatct tcttctcctg gggtctggcc 2340 
ccccaaaatg cctaacccag gaccttggaa attctactca tcccaaatga taattccaaa 2400 
tgctqttacc caaggttagg gtgttgaagg aaggtagagg gtggggcttc aggtctcaac 24 60 
CKT.-M'.cccta accacccctc ttctcttggc ccagcctggt tccccccact tccactcccc 2520 
L c cii etc t cLaggactgg gctgatgaag gcactgccca aaattitcccc tacccccaac 2580 
1 ttcccctac ccccaacttt ccccaccagc tccacaaccc tgtttggagc tactgcagga 2640 
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ccagaagcac aaagtgcggt ttcccaagcc 
gtgcttgggg aatctcacac agaaactcag 
ttatctctca gggggggttt aagtgccgtt 
ggtgaatatt ttatactgta agtgagcaat 
aaggctttct tatatgttta aaaa 



tttgtccatc tcagccccca gagtatatct 2700 
gagcaccccc tgcctgagct aagggaggtc 2760 
tgcaataatg tcgtcttatt tatttagcgg 2820 
cagagtataa tgtttatggt gacaaaatta 2880 

2904 



<210> 625 

<211> 4034 

<212> DNA 

<213> Homo sapiens 

<400> 625 

aaccagcctg cacgcgctgg ctccgggtga cagccgcgcg cctcggccag gatctgagtg 60 
atgagacgtg tccccactga ggtgccccac agcagcaggt gttgagcatg ggctgagaag 120 
ctggaccggc accaaagggc tggcagaaat gggcgcctgg ctgattccta ggcagttggc 180 
ggcagcaagg aggagaggcc gcagcttctg gagcagagcc gagacgaagc agttctggag 240 
tgcctgaacg gccccctgag ccctacccgc ctggcccact atggtccaga ggctgtgggt 300 
gagccgcctg ctgcggcacc ggaaagccca gctcttgctg gtcaacctgc taacctttgg 360 
cctggaggtg tgtttggccg caggcatcac ctatgtgccg cctctgctgc tggaagtggg 420 
ggtagaggag aagttcatga ccatggtgct gggcattggt ccagtgctgg gcctggtctg 480 
tgtcccgctc ctaggctcag ccagtgacca ctggcgtgga cgctatggcc gccgccggcc 540 
cttcatctgg gcactgtcct tgggcatcct gctgagcctc tttctcatcc caagggccgg 600 
ctggctagca gggctgctgt gcccggatcc caggcccctg gagctggcac tgctcatcct 660 
gggcgtgggg ctgctggact tctgtggcca ggtgtgcttc actccactgg aggccctgct 720 
ctctgacctc ttccgggacc cggaccactg tcgccaggcc tactctgtct atgccttcat 780 
gatcagtctt gggggctgcc tgggctacct cctgcctgcc attgactggg acaccagtgc 840 
cctggccccc tacctgggca cccaggagga gtgcctcttt ggcctgctca ccctcatctt 900 
cctcacctgc gtagcagcca cactgctggt ggctgaggag gcagcgctgg gccccaccga 960 
gccagcagaa gggctgtcgg ccccctcctt gtcgccccac tgctgtccat gccgggcccg 1020 
cttggctttc cggaacctgg gcgccctgct tccccggctg caccagctgt gctgccgcat 1080 
gccccgcacc ctgcgccggc tcttcgtggc tgagctgtgc agctggatgg cactcatgac 1140 
cttcacgctg -ctttacacgg atttcgtggg cgaggggctg taccagggcg tgcccagagc 1200 
tgagccgggc accgaggccc ggagacacta tgatgaaggt aaggccttgg cagccagcag 1260 
aggctggtgt gggagccgcc caccagagac gacactcggg gctgtgtctg ggctggtgcc 1320 
tctccatcct ggccccgact tctctgtcag gaaagtgggg atggacccca tctgcataca 1380 
cggcttctca tgggtgtgga acatctctgc ttgcggtttc aggaaggcct ctggctgctc 1440 
taggagtctg atcagagtcg ttgccccagt ttgacagaag gaaaggcgga gcttattcaa 1500 
agtctagagg gagtggagga gttaaggctg gatttcagat ctgcctggtt ccagccgcag 1560 
tgtgccctct gctcccccaa cgactttcca aataatctca ccagcgcctt ccagctcagg 1620 
cgtcctagaa gcgtcttgaa gcctatggcc agctgtcttt gtgttccctc tcacccgcct 1680 
gtcctcacag ctgagactcc caggaaacct tcagactacc ttcctctgcc ttcagcaagg 1740 
ggcgttgccc acattctctg agggtcagtg gaagaaccta gactcccatt gctagaggta 1800 
gaaaggggaa gggtgctggg gagcagggct ggtccacagc aggtctcgtg cagcaggtac 1860 
ctgtggttcc gccttctcat ctccctgaga ctgctccgac ccttccctcc caggctctgt 1920 
ctgatggccc ctctccctct gcaggcgttc ggatgggcag cctggggctg ttcctgcagt 1980 
gcgccatctc cctggtcttc tctctggtca tggaccggct ggtgcagcga ttcggcactc 2040 
gagcagtcta tttggccagt gtggcagctt tccctgtggc tgccggtgcc acatgcctgt 2100 
cccacagtgt ggccgtggtg acagcttcag ccgccctcac cgggttcacc ttctcagccc 2160 
tgcagatcct gccctacaca ctggcctccc tctaccaccg ggagaagcag gtgttcctgc 2220 
ccaaataccg aggggacact ggaggtgcta gcagtgagga cagcctgatg accagcttcc 2280 
tgccaggccc taagcctgga gctcccttcc ctaatggaca cgtgggtgct ggaggcagtg 2340 
gcctgctccc acctccaccc gcgctctgcg gggcctctgc ctgtgatgtc tccgtacgtg 2400 
tggtggtggg tgagcccacc gaggccaggg tggttccggg ccggggcatc tgcctggacc 2460 
tcgccatcct ggatagtgcc ttcctgctgt cccaggtggc cccatccctg tttatgggct 2520 
ccattgtcca gctcagccag tctgtcactg cctatatggt gtctgccgca ggcctgggtc 2580 
tggtcgccat ttactttgct acacaggtag tatttgacaa gagcgacttg gccaaatact 2640 
cagcgtagaa aacttccagc acattggggt ggagggcctg cctcactggg tcccagctcc 2700 
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:^cgctcctgt tagccccatg gggctgccgg gctggccgcc agtttctgrt gctgccaaag 2760 
taatgtggct ctctgctgcc accctgtgct gctgaggtgc gtagctgcac agctgggggc 2820 
-agogcgtcc ctctcctctc tccccagtct ctagggctgc ctgactggag gcctrccaag 2880 
iggqtttcag tctggactta tacagggagg ccagaagggc tccatgcact ggaatgcggg 2940 
qactctgcag gtggattacc caggctcagg gttaacagct agcctcctag ttgagacaca 3000 
cctagagaag ggtttttggg agcrgaataa actcagtcac ctggtttccc atctctaagc 3060 
-ccttaacct gcagcttcgt ttaatgtagc tcttgcatgg gagtttctag gatgaaacac 3120 
tcccccatgg gatttgaaca tatgaaagtt atttgtaggg gaagagtcct gaggggcaac 3180 
:-icaGaagaac caggtcccct cagcccacag cactgtcttt ttgctgatcc aocccccztct 3240 
L^.'ccttttat caggatgtgc ctgttggtcc ttctgttgcc atcacaqaga cacaggcatt 3300 
caaatattta acttatttat ttaacaaagt agaagggaat ccattgctag cttttctgtg 3360 
ttggtgtcta atatttgggt agggtggggg atccccaaca atcaggtccc ctgagatagc 3420 
tggtcattgg gctgatcatt gccagaatct tcttctcctg gggtctggcc ccccaaaatg 3480 
cctaacccag gaccttggaa attctactca tcccaaatga taattccaaa tgctgttacc 3540 
caaggttagg gtgttgaagg aaggtagagg gtggggcttc aggtctcaac ggcttcccta 3600 
accacccctc ttctcttggc ccagcctggt tccccccact tccactcccc tctactctct 3660 
ctaggactgg gctgatgaag gcactgccca aaatttcccc tacccccaac tttcccctac 3720 
ccccaacttt ccccaccagc tccacaaccc tgtttggagc tactgcagga ccagaagcac 3780 
aaagtgcggt ttcccaagcc tttgtccatc tcagccccca gagtatatct gtgcttgggg 3840 
aatctcacac agaaactcag gagcaccccc tgcctgagct aagggaggtc ttatctctca 3900 
gggggggttt aagtgccgtt tgcaataatg tcgtcttatt tatttagcgg ggtgaatatt 3960 
ttatactgta agtgagcaat cagagtataa tgtttatggt gacaaaatta aaggctttct 4020 
tatatgttta aaaa 4034 



<210> 626 

<211> 6976 

<212> DNA 

<213> Homo sapiens 



<400> 626 

gaagctggac 

rggcggcagc 

ggagtgcctg 

gggtgagccg 

tt ggcctgga 

I gggggtaga 

tcctgttcca 

gcttggaggg 

tgagggggca 

tcGCtggtcc 

ggctagccct 

ggtgatccat 

aaagccaaat 

ttgctcctac 

ccacagaagg 

or gtgaggat 

ctaggt ctgc 

tctgaaaatg 

gaagacgtag 

tccccttccc 

aggcattggt 

■':i:qgcgtgga 

gctgagcct c 

caggcccctg 

ggt gtgcttc 

cct gcctgcc 



cggcaccaaa 
aaggaggaga 
aacggccccc 
cctgctgcgg 
ggtgtgtttg 
ggagaagttc 
gatacatgcc 
agaggtggtc 
caccaagaga 
tgcacagagc 
tttataattt 
tttacagaga 
agcagctgaa 
ccctaggaca 
caagtagtaa 
tgcaggctct 
gcctttgaat 
ggggcatctg 
cacagtgtcg 
cttgcccttt 
ccagtgctgg 
cgct atggcc 
tttctcatcc 
gagctggcac 
act ccactgg 
tactctgtct 
attgactggg 



gggctggcag 
ggccgcagct 
tgagccctac 
caccggaaag 
gccgcaggca 
atgaccatgg 
acctggcatg 
tgctgggaag 
aagaagagaa 
ctggctcata 
atagcgatta 
aggaagcaga 
cagtagagct 
aggggtggac 
ccattgtgtg 
ggagtcacac 
aagtatcact 
taatgcctgt 
agtacggaat 
acttgtccca 
gcctggtctg 
gccgccggcc 
caagggccgg 
tgctcatcct 
aggccctgct 
atgccttcat 
acaccagtgc 



aaatgggcgc 
tctggagcag 
ccgcctggcc 
cccagctctt 
tcacctatgt 
tgctgggtga 
tgggacagga 
gcattgctgg 
taccaaggac 
ggagacactg 
tctcatttaa 
ggcttttaag 
gggactccat 
tcctgactct 
acaacccct c 
tgcttgttga 
tmttagttgc 
gttatgagga 
gttatttcca 
gccattgact 
tgtcccgctc 
cttcatctgg 
ctggctagca 
gggcgtgggg 
ct ctgacct c 
gatcagtctt 
cctggccccc 



ctggctgatt 
agccgagacg 
cactatggtc 
gctggtcaac 
gccgcctctg 
gtcactacat 
gtacctctgc 
gcaggagggt 
ataccccagt 
gagaaatgct 
tgcttacaac 
aggttaggta 
caaggtctcc 
gcagataaat 
acccccagga 
aacgctgcct 
t ccatgcctc 
gtaaattaca 
tccttctcac 
catactactt 
ctaggctcag 
gcactgtcct 
gggctgctgt 
ctgctggact 
ttccgggacc 
gggggctgcc 
tacctgggca 



cctaggcagt 
aagcagttct 
cagaggctgt 
ctgctaacct 
ctgctggaag 
cctccttcct 
cctgggagct 
gaccctgggc 
cacctctgga 
cctaaccttt 
caccatttga 
agtcttagcc 
cagccggagc 
tctacaaaag 
agaggggccc 
cttaccctcc 
agtttgtcca 
gcatccctgt 
ggagcttggt 
cccttcttgc 
ccagtgacca 
tgggcatcct 
gcccggatcc 
tctgtggcca 
cggaccactg 
tgggctacct 
cccaggagga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

15 60 

1620 
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gtgcctcttt qgcctgctca ccctcatctt cctcacctgc gtagcagcca cactgctggt 1680 
ggctgaggag gcagcgctgg gccccaccga gccagcagaa gggctgtcgg ccccctcctt 1740 
gtcqccccac tgctgtccat gccgggcccg cttggctttc cggaacctgg gcgccctgct 1800 
tccccggctg caccagctgt gctgccgcat gccccgcacc ctgcgccggc tcttcgtggc 1860 
tgagctgtgc agctggatgg cactcatgac cttcacgctg ttttacacgg atttcgtggg 1920 
cgaggggctg taccagggcg tgcccagagc tgagccgggc accgaggccc ggagacacta 1980 
tgatgaaggt aaggccttgg cagccagcag aggctggtgt gggagccgcc caccagagac 2040 
gacactcggg gctgtgtctg ggctggtgcc tctccatcct ggccccgact tctctgtcag 2100 
gaaagtgggg atggacccca tctgcataca cggcttctca tgggtgtgga acatctctgc 2160 
ttgcggtttc aggaaggcct ctggctgctc taggagtctg atcagagtcg ttgccccagt 2220 
ttgacagaag gaaaggcgga gcttattcaa agtctagagg gagtggagga gttaaggctg 2280 
gatttcagat ctgcctggtt ccagccgcag tgtgccctct gctcccccaa cgactttcca 2340 
aataatctca ccagcgcctt ccagctcagg cgtcctagaa gcgtcttgaa gcctatggcc 2400 
agctgtcttiL qtgttccctc tcacccgcct gtcctcacag ctgagactcc caggaaacct 2460 
tcagactacc ttcctctgcc ttcagcaagg ggcgttgccc acattctctg agggtcagtg 2520 
gaagaaccta g3.:tcccatt gctagaggta gaaaggggaa gggtgctggg gagcagggct 2580 
ggtccacagc aggtctcgtg cagcaggtac ctgtggttcc gccttctcat ctccctgaga 2640 
ctgctccgac ccttccctcc caggctctgt ctgatggccc ctctccctct gcaggcgttc 2700 
ggatgggcag cct.ggggctg ttcctgcagt gcgccatctc cctggtcttc tctctggtca 2760 
tggaccggct qgtgcagcga ttcggcactc gagcagtcta tttggccagt gtggcagctt 2820 
tccctgtggc tgccggtgcc acatgcctgt cccacagtgt ggccgtggtg acagcttcag 2880 
ccgccctcac caggttcacc ttctcagccc tgcagatcct gccctacaca ctggcctccc 2940 
tctaccaccg ggagaagcag gtactcattg gccagtgggt ggagtcaggg tgggaggggt 3000 
ggtctgggtt titgggagqc caactagctc agaacctggt atctggcaag caactttgga 3060 
gaatgcttct ttgaatcaga gaagaagctt atcctagccc cagggccaga ggcttgggct 3120 
gcagaacagt ctagattaga ttctgggaat gacttcctgg ggtcaggact gtgtagcact 3180 
tgaatggatg attgcaggaa atgcaaaata cgatagtggg aatcccgaag ggtcaggcca 3240 
gcaggagccc taggcttcta ggctggttgt tctatggaga ggcagggcgc tgaatcagat 3300 
gacccctggg ccattcagcc tcagcagacg ggagtgggaa tggtccagcc ttagcaacac 3360 
ctttcttcag ggagcagcaa cctgacttag cctgtatcct actctggtct ctgagatggg 3420 
gcaggctcct tcctaccccc tttctttctg gcttattttt cttttctgtc taattccctt 3480 
gcatccctcc tttgcctcct tccctttctc cttccccttc cccttcccct 3540 
gtggcagatci tctgagcttg acacctgacc cactcacttg ggcactgtgt aagttgtggg 3600 
gacctccttc ttggttggcc ctacactaac cagcccctcc aggggcccct ttccttggga 3660 
agccacctaa cccaggtagt gtggtcatcc ttgtcccctc cactgacctc actgagctac 3720 
aaacctgggt gctggactct gccttgaggg gcatgaagtt ggggtgtccc aagggaggag 3780 
gagatgcagg actgctctca tagagctctc agactgtagg gaagacctgc ccctgcgtct 3840 
cgtagcactt gaggagagga gtaggtaagt tcgtagctga gaggctggtt aactgagtag 3900 
gtagctgcag gggtgagagg tatggagggg aggggctaag gttttggttg ggggagcctg 3960 
gtccctgaga cccctgttag cccactgata accttcttca gccttcactc ttctgcttgc 4020 
ctgggctggg ggcagggggc tggcatcagc ggccaggcct gagtatgtgc tgtcgtgcca 4080 
gggaacgttc tggggctagc catcttctcc agatggagga gcatgtctgt cctcggacca 4140 
ctccagactc caacctcagc ggacattcct ggggtggcag gcagggagga gaagtcctgg 4200 
gaggcccctt cctaacagca gctgatggca gacttggcac tgcacgctgt ctgcctgttc 4260 
ctttgcccac ttgttgagct gcatggtgag ccgtgggctt ccctggtgtc aggtttgagc 4 320 
tctgccatgg ctcccacctc gcaaatgcag ccaactcaac tcttctggca tggggacaat 4380 
gttggataag acctggcctt gtccttaaat aggaggctct gggccatcaa gggcaggggt 4 440 
tggggggatg gtggtcgacc agtcactctg atctaagtca gacagcagga aggaagtgag 4 500 
aagccttcaa cattagcaca gctggggctg ggggaggtgg gaagagggac attcctcctg 4 560 
cttggggtct actggattct ccctgcccca aggctgggga caagggagct catggcaggg 4 620 
cagctaccct agtggcatct gggaccccag agaggcagag cttctctgca ccgggcaatg 4 680 
aggatttcca gatgtcggag tggagggcag gcaggaagga aggttaggag agcctgcgtg 4 740 
gggtttgggc catcaggggc cctgccttgg cttttgttcc tctgttctgt gcatctctta 4800 
ccaccgtctt cattccccct gtgtcttttc cttaccttgg agctctgttc tctctgatct 4860 
gtgatattga gtttgtctgc ctcttacctg ttctaagagg ctagaggaga cctagacttc 4 920 
tgggttcaca tttgtccccg ccctaccccg ttacccttct cccactcctg aggaagggtc 4 980 
ctggttagac ttggaccaag tagggtctcc atcttctctc ctgctcctga ttctcatgaa 5040 
gtcccattgc ccctgggatg gaggcaaggg tctgttctca cagctggggt ggtgccagtg 5100 
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■ ; igt acac 
c a a t. a c c g a 
: . : ^ J ccct 
-C't ';:ctccca 
qqt qgtgggt 
ccccatcczg 
-■A' tat ccaq 
qqt cgccatt 
agcQ t agaaa 
cgctcctgt t 
■ jt qtggct c 
gggqcgt ccc 
gggt ttcagt 
actctgcagg 
ctagagaagg 
ccttaacctg 
c:ct:ccatggg 
c a c a a g a a c c 
acct tttatc 
taaatat tta 
ttggtgtcta 
*-.gqt cattgg 
cctaacccag 
raaggttagg 
accacccctc 
ctaggact gg 
ccccaacttt 
aaagtgcggt 
aatct cacac 

gggggggttt 

t tatactgta 
t atatgtt ta 



acctgt cctc 
ggggacact g 
aagcct ggac 
cct ccacccg 
gagcccaccg 
gatagtgcct 
ctcagccagt 
tacttt acta 
acttccagca 
agccccatgg 
tctgctgcca 
tctcct ctct 
ctggacttat 
tggattaccc 
gtttttggga 
cagcttcgtt 
atttgaacat 
aggt cccctc 
aggatgtggc 
acttatttat 
atatttgggt 
gctgat catt 
gaccttggaa 
gtgttgaagg 
ttctcttggc 
gctgatgaag 
ccccaccagc 
ttcccaagcc 
agaaactcag 
aagtgccgtt 
agtgagcaat 
aaaaaa 



tt cccctttt 
gaggtgctag 
ct ccct tccc 
cgctct gcgg 
aggccagggt 
t cctgc tgtc 
cr gtcactgc 
cacaggtagr 
cattggggtg 
ggctgccggg 
ccctgtgctg 
ccccagtctc 
acagggaggc 
aggctcaggg 
gctgaataaa 
taatgtagct 
atgaaagtta 
agcccacagc 
ctgttggt cc 
ttaacaaagt 
agggtggggg 
gccagaatct 
attctactca 
aaggt agagg 
ccagcctggt 
gcactgccca 
tccacaaccc 
tttgtccatc 
gagcaccccc 
tgcaataatg 
cagagtataa 



ct tcacccct 
cagtgaggac 
t aatggacac 
ggcct ctgcc 
ggttccgggc 
ccaggtggcc 
ct at atggtg 
atttgacaag 
gagggcctgc 
ctggccgcca 
ctgaggtgcg 
tagggctgcc 
cagaagggct 
ttaacagcta 
ct cagtcacc 
cttgcatggg 
tttgtagggg 
actgt ctttt 
ttctgttgcc 
agaagggaat 
at ccccaaca 
tcttctcctg 
t cccaaatga 
gtggggcttc 
tccccccact 
aaatttcccc 
tgtttggagc 
tcagccccca 
tgcctgagct 
tcgtcttatt 
tgtttatggt 



ctgccttagg 
agcctgatga 
gtgggtgctg 
tgtgatgtct 
cggggcatct 
ccatccctgt 
tct gccgcag 
agcgacttgg 
ctcactgggt 
gtttctgttg 
tagctgcaca 
tgactggagg 
ccatgcact g 
gcctcctagt 
tggtttccca 
agtttctagg 
aagagtcctg 
t gctgatcca 
at cacagaga 
ccattgctag 
atcaggtccc 
gggtctggcc 
taattccaaa 
aggtctcaac 
tccactcccc 
tacccccaac 
tactgcagga 
gagtatatct 
aagggaggtc 
tatttagcgg 
gacaaaatta 



tgttcctgcc 
ccagcttccr 
gaggcagtgq 
ccgtacgtgt 
gcctggacct 
t tatgggctc 
gcctgggt ct 
ccaaat actc 
cccagct ccc 
ctgccaaagt 
gctgggggct 
ccttccaagg 
gaatgcgggg 
tgagacacac 
tctctaagcc 
atgaaacact 
aggggcaaca 

CCCGCCtctt 

cacaggcatt 
cttttctgtg 
ctgagatagc 
ccccaaaatg 
tgctgt tacc 
ggcttcccta 
tctactctct 
tttcccctac 
ccagaagcac 
gtgcttgggg 
ttatct ctca 
ggtgaatatt 
aaggctttct 



5160 
5220 
5280 
5340 
5400 
54 60 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
6976 



<210> 627 
<211> 123 
•;212> PRT 

<2i3> Homo sapiens 
<400> 627 

Met: Gly Ser Leu Gly Leu Phe Leu Gin Cys Ala lie Ser Leu Val Phe 
5 10 15 



Ser Leu Val Met Asp Arg Leu Val 
20 

Tyr Leu Ala Ser Val Ala Ala Phe 

35 40 

Leu Ser His Ser Val Ala Val Val 
50 55 



Gin Arg Phe Gly Thr Arg Ala Val 
25 30 

Pro Val Ala Ala Gly Ala Thr Cys 
45 

Thr Ala Ser Ala Ala Leu Thr Gly 
60 



''he Thi Phe Ser Ala Leu Gin lie Leu Pro Tyr Thr Leu Ala Ser Leu 
65 70 75 80 

Tyr Wis Arg Glu Lys Gin Val Leu lie Gly Gin Trp Val Glu Ser Gly 
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Trp Glu Gly Trp Ser Gly Phe Leu Gly Gly Gin Leu Ala Gin Asn Leu 
100 105 110 



Val Ser Gly Lys Gin Leu Trp Arg Met Leu Leu 
115 120 



<210> 628 ' 

<211> 150 

<212> PRT 

<213> Homo sapiens 

<400> 628 

Met Val Gin Arg Leu Trp Val Ser Arg Leu Leu Arg His Arg Lys Ala 
5 10 15 

Gin Leu Leu Leu Val Asn Leu Leu Thr Phe Gly Leu Glu Val Cys Leu 
20 25 30 

Ala Ala Gly He Thr Tyr Val Pro Pro Leu Leu Leu Glu Val Gly Val 
35 40 45 

Glu Glu Lys Phe Met Thr Met Val Leu Gly Glu Ser Leu His Pro Pro 
50 55 60 

Ser Phe Leu Phe Gin He His Ala Thr Trp His Val Gly Gin Glu Tyr 

70 75 80 

Leu Cys Pro Gly Ser Cys Leu Glu Gly Glu Val Val Cys Trp Glu Gly 
B5 90 95 

He Ala Gly Gin Glu Gly Asp Pro Gly Leu Ara Glv His Thr Lvs Arn 
100 105 110 

Lys Lys Arg He Pro Arg Thr Tyr Pro Ser His Leu Trp He Pro Gly 
115 120 125 

Pro Ala Gin Ser Leu Ala His Arg Arg His Trp Arg Asn Ala Pro Asn 
130 135 140 

Leu Trp Leu Ala Leu Leu 
145 150 



<210> 629 
<211> 371 
<212> PRT 
<213> Homo sapiens 

<400> 629 

Met Leu Phe Pro Ser Phe Ser Arg Ser Leu Val Pro Leu Pro Leu Ala 
5 10 15 

Leu Tyr Leu Ser Gin Pro Leu Thr His Thr Thr Ser Leu Leu Ala Gly 
20 25 30 

He Gly Pro Val Leu Gly Leu Val Cys Val Pro Leu Leu Gly Ser Ala 
35 40 45 



BNSOOCtD < wo 0151633A2 L> 



w o 01/51633 



P( T/l S<>1/01574 



240 



Ser Asp His Trp Arg Gly Arg Tyr Giy Arg Arg Arg Pro Phe lie Trp 
50 ' 55 60 

Ala Leu Ser Leu Gly lie Leu Leu Ser Leu Phe Leu lie Pro Arg Ala 
65 70 75 80 

Gly Trp Leu Ala Gly Leu Leu Cys Pro Asp Pro Arg Pro Leu Glu Leu 
85 90 95 

Ala Leu Leu He Leu Gly Val Gly Leu Leu Asp Phe Cys Gly Gin Val 
100 105 110 

Cys Phe Thr Pro Leu Glu Ala Leu Leu Ser Asp Leu Phe Arg Asp Pro 
115 120 125 

Asp His Cys Arg Gin Ala Tyr Ser Val Tyr Ala Phe Met He Ser Leu 
130 135 140 

Gly Giy Cys Leu Gly Tyr Leu Leu Pro Ala He Asp Trp Asp Thr Ser 
145 150 155 160 

Ala Leu Ala Pro Tyr Leu Gly Thr Gin Glu Glu Cys Leu Phe Gly Leu 
165 170 175 

Leu Thr Leu He Phe Leu Thr Cys Val Ala Ala Thr Leu Leu Val Ala 
180 185 190 

Glu Glu Ala Ala Leu Gly Pro Thr Glu Pro Ala Glu Gly Leu Ser Ala 
195 200 205 

^ro Ser Leu Ser Pro His Cys Cys Pro Cys Arg Ala Arg Leu Ala Phe 
210 215 220 

Arg Asn Leu Gly Ala Leu Leu Pro Arg Leu His Gin Leu Cys Cys Arg 
225 230 235 240 

Met Pro Arg Thr Leu Arg Arg Leu Phe Val Ala Glu Leu Cys Ser Trp 
245 250 255 

Met Ala Leu Met Thr Phe Thr Leu Phe Tyr Thr Asp Phe Val Gly Glu 

260 265 270 

Gly Leu Tyr Gin Gly Val Pro Arg Ala Glu Pro Gly Thr Glu Ala Arg 
275 280 285 

Arg His Tyr Asp Glu Gly Lys Ala Leu Ala Ala Ser Arg Gly Trp Cys 
290 295 300 

Gly Ser Arg Pro Pro Glu Thr Thr Leu Gly Ala Val Ser Gly Leu Val 

305 310 315 320 

Pro Leu His Pro Gly Pro Asp Phe Ser Val Arg Lys Val Gly Met Asp 
325 330 335 

Pro He Cys He His Gly Phe Ser Trp Val Trp Asn He Ser Ala Cys 
340 345 350 
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Gly Phe Arg Lys Ala Ser Gly Cys Ser Arg Ser Leu lie Arg Val Val 
355 360 365 



Ala Pro Val 
370 



<210> 630 

<211> 2983 

<212> DMA 

<213> Homo sapiens 

<400> 630 

agagatagag tcttccctgg cattgcagga gagaatctga agggatgatg gatgcatcaa 60 
aagagctgca agttctccac attgacttct tgaatcagga caacgccgtt tctcaccaca 120 
catgggagtt ccaaacgagc agtcctgtgt tccggcgagg acaggtgttt cacctgcggc 180 
tggtgctgaa ccagccccta caatcctacc accaactgaa actggaattc agcacagggc 240 
cgaatcctag catcgccaaa cacaccctgg tggtgctcga cccgaggacg ccctcagacc 300 
actacaactg gcaggcaacc cttcaaaatg agtctggcaa agaggtcaca gtggctgtca 360 
ccagttcccc caatgccatc ctgggcaagt accaactaaa cgtgaaaact ggaaaccaca 420 
tccttaagtc tgaagaaaac atcctatacc ttctcttcaa cccatggtgt aaagaggaca 480 
tggttttcat gcctgatgag gacgagcgca aagagtacat cctcaatgac acgggctgcc 540 
attacgtggg ggctgccaga agtatcaaat gcaaaccctg gaactttggt cagtttgaga 600 
aaaatgtcct ggactgctgc atttccctgc tgactgagag ctccctcaag cccacagata 660 
ggagggaccc cgtgctggtg tgcagggcca tgtgtgctat gatgagcttt gagaaaggcc 720 
agggcgtgct cattgggaat tggactgggg actatgaagg tggcacagcc ccatacaagt 780 
ggacaggcag tgccccgatc ctgcagcagt actacaacac gaagcaggct gtgtgctttg 840 
gccagtgctg ggtgtttgct gggatcctga ctacagtgct gagagcgttg ggcatcccag 900 
cacgcagtgt gacaggcttc gattcagctc acgacacaga aaggaacctc acggtggaca 960 
cctatgtgaa tgagaatggc aagaaaatca ccagtatgac ccacgactct gtctggaatt 1020 
Lccatgtgtg gacggargcc tggatgaagc gaccggatct gcccaagggc tacgacggct 1080 
ggcaggctgt ggacgcaacg ccgcaggagc gaagccaggg tgtcttctgc tgtgggccat 1140 
caccactgac cgccatccgc aaaggtgaca tctttattgt ctatgacacc agattcgtct 1200 
tctcagaagt gaatggtgac aggctcatct ggttggtgaa gatggtgaat gggcaggagg 1260 
agttacacgt aatttcaatg gagaccacaa gcatcgggaa aaacatcagc accaaggcag 1320 
tgggccaaga caggcggaga gatatcacct atgagtacaa gtatccagaa ggctcctctg 1380 
aggagaggca ggtcatggat catgccttcc tccttctcag ttctgagagg gagcacagac 1440 
gacctgtaaa agagaacttt cttcacatgt cggtacaatc agatgatgtg ctgctgggaa 1500 
actctgttaa tttcaccgtg attcttaaaa ggaagaccgc tgccctacag aatgtcaaca 1560 
tcttgggctc ctttgaacta cagttgtaca ctggcaagaa gatggcaaaa ctgtgtgacc 1620 
tcaataagac ctcgcagatc caaggtcaag tatcagaagt gactctgacc ttggactcca 1680 
agacctacat caacagcctg gctatattag atgatgagcc agttatcaga ggtttcatca 174 0 
ttgcggaaat tgtggagtct aaggaaatca tggcctctga agtattcacg tctttccagt 1800 
accctgagtt ctctatagag ttgcctaaca caggcagaat tggccagcta cttgtctgca 1860 
attgtatctt caagaatacc ctggccatcc ctttgactga cgtcaagttc tctttggaaa 1920 
gcctgggcat ctcctcacta cagacctctg accatgggac ggtgcagcct ggtgagacca 1980 
tccaatccca aataaaatgc accccaataa aaactggacc caagaaattt atcgtcaagt 2040 
taagttccaa acaagtgaaa gagattaatg ctcagaagat tgttctcatc accaagtagc 2100 
cttgtctgat gctgtggagc cttagttgag atttcagcat ttcctacctt gtgcttagct 2160 
ttcagattat ggatgattaa atttgatgac ttatatgagg gcagattcaa gagccagcag 2220 
gtcaaaaagg ccaacacaac cataagcagc cagacccaca aggccaggtc ctgtgctatc 2280 
acagggtcac ctcttttaca gttagaaaca ccagccgagg ccacagaatc ccatcccttt 234 0 
cctgagtcat ggcctcaaaa atcagggcca ccattgtctc aattcaaatc catagatttc 2400 
gaagccacag agtctctccc tggagcagca gactatgggc agcccagtgc tgccacctgc 2460 
tgacgaccct tgagaagctg ccatatcttc aggccatggg ttcaccagcc ctgaaggcac 2520 
ctgtcaactg gagtgctctc tcagcactgg gatgggcctg atagaagtgc attctcctcc 2580 
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vcttgccrcc attctccrct ctctatccct ga 

caaqcagttt gaagcccaat ctgcaaggac at 

.^Hccctgtcc tcaggcctga actcaccata ga 

aatcctcttc ccttattcta aagctgcccc tt 

r.ctccctggt gtgaactctt tctttggtat tc 

cttctgttaa ctgaagcctg ctccttcttg tt 

::aataagctt taaattaaac tctacttcaa ga 



aatccagg aagtccctct cctggtgctc 2640 
ttctcaag ggccatgtgg ttttgcagac 27C0 
gacccatg tcagcaaacg gtgaccagca 2760 
gggagact ccagggagaa ggcattgctt 2820 
catccact atcctggcaa ctcaaggctg 2880 
ctgccctc cagagatttg ctcaaatgat 2940 
aaaaaaaa ccg 2983 



<210> 631 
'.211> 3064 
<212> DNA 

<213> Homo sapiens 



<4C0> 631 

aattctaaaa atgcttttgc aagcttgcat 
ggatatctgc agaattcggc ttgcgctcag 
ggcattgcag gagagaatct gaagggatga 
acattgactt cttgaatcag gacaacgccg 
gcagtcctgt gttccggcga ggacaggtgt 
tacaatccta ccaccaactg aaactggaat 
aacacaccct ggtggtgctc gacccgagga 
cccttcaaaa tgagtctggc aaagaggtca 
tcctgggcaa gtaccaacta aacgtgaaaa 
acatcctata ccttctcttc aacccatggt 
aggacgagcg caaagagtac atcctcaatg 
gaagtatcaa atgcaaaccc tggaactttg 
gcatttccct gctgactgag agctccctca 
tgtgcagggc catgtgtgct atgatgagct 
attggactgg ggactacgaa ggtggcacag 
tcctgcagca gtactacaac acgaagcagg 
ctgggatcct gactacagtg ctgagagcgt 
tcgattcagc tcacgacaca gaaaggaacc 
gcgagaaaat caccagtatg acccacgact 
cctggatgaa gcgaccctac gacggctggc 
gccagggtgt cttctgctgt gggccatcac 
ttattgtcta tgacaccaga ttcgtcttct 
tggtgaagat ggtgaatggg caggaggagt 
tcgggaaaaa catcagcacc aaggcagtgg 
agtacaagta tccagaaggc tcctctgagg 
ttctcagttc tgagagggag cacagacagc 
tacaatcaga tgatgtgctg ctgggaaact 
agaccgctgc cctacagaat gtcaacatct 
gcaagaagat ggcaaaactg tgtgacctca 
cagaagtgac tctgaccttg gactccaaga 
atgagccagt tatcagaggt ttcatcattg 
cctctgaagt attcacgtca aaccagtacc 
gcagaattgg ccagctactt gtctgcaatt 
tgactgacgt caagttctct ttggaaagcc 
atgggacggt gcagcctggt gagaccatcc 
ctggacccaa gaaatttatc gtcaagttaa 
agaagattgt tctcatcacc aagtagcctt 
tcagcatttc ctaccttgtg cttagctttc 
tatgagggca gattcaagag ccagcaggtc 
acccacaagg ccaggtcctg tgctatcaca 
gccgaggcca cagaatccca tccctttcct 
ttgtctcaat tcaaatccat agatttcgaa 
Lalqgycagc ccagtgctgc cacctgctga 
ccatgggttc accagccctg aaggcacctg 



gcctgcaggt gcagcggccg ccagtgtgat 60 
ctggaattcc gcagagatag agtcttccct 120 
tggatgcatc aaaagagctg caagttctcc 180 
tttctcacca cacatgggag ttccaaacga 240 
ttcacctgcg gctggtgctg aaccagcccc 300 
tcagcacagg gccgaatcct agcatcgcca 360 
cgccctcaga ccactacaac tggcaggcaa 420 
cagtggctgt caccagttcc cccaatgcca 480 
ctggaaacca catccttaag tctgaagaaa 540 
gtaaagagga catggttttc atgcctgatg 600 
acacgggctg ccattacgtg ggggctgcca 660 
gtcagtttga gaaaaatgtc ctggactgct 720 
agcccacaga taggagggac cccgtgctgg 780 
ttgagaaagg ccagggcgtg ctcattggga 840 
ccccatacaa gtggacaggc agtgccccga 900 
ctgtgtgctt tggccagtgc tgggtgtttg 960 
tgggcatccc agcacgcagt gtgacaggct 1020 
tcacggtgga cacctatgtg aatgagaatg 1080 
ctgtctggaa tttccatgtg tggacggatg 1140 
aggctgtgga cgcaacgccg caggagcgaa 1200 
cactgaccgc catccgcaaa ggtgacatct 1260 
cagaagtgaa tggtgacagg ctcatctggt 1320 
tacacgtaat ttcaatggag accacaagca 1380 
gccaagacag gcggagagat atcacctatg 1440 
agaggcaggt catggatcat gccttcctcc 1500 
ctgtaaaaga gaactttctt cacatgtcgg 1560 
ctgttaattt caccgtgatt cttaaaagga 1620 
tgggctcctt tgaactacag ttgtacactg 1680 
ataagacctc gcagatccaa ggtcaagtat 1740 
cctacatcaa cagcctggct atattagatg 1800 
cggaaattgt ggagtctaag gaaatcatgg 1860 
ctgagttctc tatagagttg cctaacacag 1920 
gtatcttcaa gaataccctg gccatccctt 1980 
tgggcatctc ctcactacag acctctgacc 2040 
aatcccaaat aaaatgcacc ccaataaaaa 2100 
gttccaaaca agtgaaagag attaatgctc 2160 
gtctgatgct gtggagcctt agttgagatt 2220 
agattatgga tgattaaatt tgatgactta 2280 
aaaaaggcca acacaaccat aagcagccag 2340 
gggtcacctc ttttacagtt agaaacacca 2400 
gagtcatggc ctcaaaaatc agggccacca 2460 
gccacagagc tcttccctgg agcagcagac 2520 
cgacccttga gaagctgcca tatcttcagg 2580 
tcaactggag tgctctctca gcactgggat 2640 



wo 01/51633 



243 



PCT/LJS01/()1574 



gggcctgata gaagtgcatt ctcctcctat tgcctccatt ctcctctctc tatccctgaa 2700 
atccaggaag tccctctcct ggtgctccaa gcagtttgaa gcccaatctg caaggacatt 2760 
tctcaagggc catgtggttt tgcagacaac cctgtcctca ggcctgaact caccatagag 2820 
acccatgtca gcaaacggtg accagcaaat cctcttccct tattctaaag ctgccccttg 2880 
ggagactcca gggagaaggc attgcttcct ccctggtgtg aactctttct ttggtattcc 2940 
atccactatc ctggcaactc aaggctgctt ctgttaactg aagcctgctc cttcttgttc 3000 
tgccctccag agatttgctc aaatgatcaa taagctttaa attaaaccgg aatccgcgga 3060 
^^^"^ 3064 

<210> 632 
<211> 684 
<212> PRT 

<213> Homo sapiens 
<400> 632 

Met Met Asp Ala Ser Lys Glu Leu Gin Val Leu His He Asp Phe Leu 
5 10 15 

Asn Gin Asp Asn Ala Val Ser His His Thr Trp Glu Phe Gin Thr Ser 
20 25 30 

Ser Pro Val Phe Arg Arg Gly Gin Val Phe His Leu Arg Leu Val Leu 
35 4 0 4 5 

Asn Gin Pro Leu Gin Ser Tyr His Gin Leu Lys Leu Glu Phe Ser Thr 
SO 55 60 

Gly Pro Asn Pro Ser He Ala Lys His Thr Leu Val Val Leu Asp Pro 
^S "70 75 80 

Arg Thr Pro Ser Asp His Tyr Asn Trp Gin Ala Thr Leu Gin Asn Glu 
85 90 95 

Ser Gly Lys Glu Val Thr Val Ala Val Thr Ser Ser Pro Asn Ala He 
100 105 110 

Leu Gly Lys Tyr Gin Leu Asn Val Lys Thr Gly Asn His He Leu Lys 

120 125 

Ser Glu Glu Asn He Leu Tyr Leu Leu Phe Asn Pro Trp Cys Lys Glu 
130 135 140 

Asp Met Val Phe Met Pro Asp Glu Asp Glu Arg Lys Glu Tyr He Leu 

150 155 160 

Asn Asp Thr Gly Cys His Tyr Val Gly Ala Ala Arg Ser He Lys Cys 
165 170 175 

Lys Pro Trp Asn Phe Gly Gin Phe Glu Lys Asn Val Leu Asp Cys Cys 
IBO 185 190 

He Ser Leu Leu Thr Glu Ser Ser Leu Lys Pro Thr Asp Arg Arg Asp 
195 200 205 

Pro Val Leu Val Cys Arg Ala Met Cys Ala Met Met Ser Phe Glu Lvs 
210 215 220 

Gly Gin Gly Val Leu lie Gly Asn Trp Thr Gly Asp Tyr Glu Gly Gly 
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225 230 235 240 

Thr Ala Pro Tyr Lys Trp Thr Giy Ser Ala Pro He Leu Gin Gin Tyr 
245 ^ 250 255 

Tyr Asn Thr Lys Gin Ala Val Cys Phe Gly Gin Cys Trp Val Phe Ala 
260 265 270 

Gly He Leu Thr Thr Val Leu Arg Ala Leu Gly He Pro Ala Arg Ser 
275 280 285 

Val Thr Gly Phe Asp Ser Ala His Asp Thr Glu Arg Asn Leu Thr Val 
290 295 300 

Asp Thr Tyr Val Asn Glu Asn Gly Lys Lys He Thr Ser Met Thr His 
305 310 315 320 

Asp Ser Val Trp Asn Phe His Val Trp Thr Asp Ala Trp Met Lys Arg 
325 330 335 

Pro Asp Leu Pro Lys Gly Tyr Asp Gly Trp Gin Ala Val Asp Ala Thr 
340 345 350 

Pro Gin Glu Arg Ser Gin Gly Val Phe Cys Cys Gly Pro Ser Pro Leu 
355 360 365 

Thr Ala He Arg Lys Gly Asp He Phe He Val Tyr Asp Thr Arg Phe 
370 375 380 

Val Phe Ser Glu Val Asn Gly Asp Arg Leu He Trp Leu Val Lys Met 
385 390 395 400 

\^al Asn Gly Gin Glu Glu Leu His Val He Ser Met Glu Thr Thr Ser 
405 410 415 

He Gly Lys Asn He Ser Thr Lys Ala Val Gly Gin Asp Arg Arg Arg 
420 425 430 

Asp He Thr Tyr Glu Tyr Lys Tyr Pro Glu Gly Ser Ser Glu Glu Arg 
435 440 445 

Gin Val Met Asp His Ala Phe Leu Leu Leu Ser Ser Glu Arg Glu His 
450 455 460 

Arg Arg Pro Val Lys Glu Asn Phe Leu His Met Ser Val Gin Ser Asp 
465 470 475 480 

Asp Val Leu Leu Gly Asn Ser Val Asn Phe Thr Val He Leu Lys Arg 
485 490 495 

Lys Thr Ala Ala Leu Gin Asn Val Asn He Leu Gly Ser Phe Glu Leu 
500 505 510 

:sin Leu Tyr Thr Gly Lys Lys Met Ala Lys Leu Cys Asp Leu Asn Lys 
515 520 525 

'TiL^ c--^- r'T-. -TT^ /-^T^ r--"!,, r^l Oi-i-*- r^Tn i;-,! TK-v- T^-mi TK-k- T£=in Zicri^ 

.! (1 r oer ij-Lii X-LC: vjJ_ii J~ y \J-Lii V oex vj-LLi V a -i- j-iiA. xj>^i-t j-iij- j^v--^ 

530 535 540 



wo 01/51633 PCT/lJSOl/01574 

245 



Ser Lys Thr Tyr 
545 

lie Arg Gly Phe 

Ala Ser Glu Val 
580 

Leu Pro Asn Thr 
595 

Phe Lys Asn Thr 
610 

Glu Ser Leu Gly 
625 

Gin Pro Gly Glu 



Thr Gly Pro Lys 
660 

Glu lie Asn Ala 
675 



lie Asn Ser Leu 
550 

lie lie Ala Glu 
565 

Phe Thr Ser Phe 



Gly Arg lie Gly 
600 

Leu Ala lie Pro 
615 

lie Ser Ser Leu 
630 

Thr He Gin Ser 
645 

Lys Phe He Val 



Gin Lys He Val 
680 



Ala He Leu Asp 

555 

He Val Glu Ser 
570 

Gin Tyr Pro Glu 
585 

Gin Leu Leu Val 



Leu Thr Asp Val 
620 

Gin Thr Ser Asp 

635 

Gin He Lys Cys 
650 

Lys Leu Ser Ser 
665 

Leu He Thr Lys 



Asp Glu Pro Val 
560 

Lys Glu He Met 
575 

Phe Ser He Glu 
590 

Cys Asn Cys He 
605 

Lys Phe Ser Leu 



His Gly Thr Val 
640 

Thr Pro He Lys 
655 

Lys Gin Val Lys 
670 



<210> 633 

<211> 679 

<212> PRT 

<213> Homo sapiens 

<400> 633 

Met Met Asp Ala Ser Lys Glu Leu Gin Val Leu His He Asp Phe Leu 
5 10 15 

Asn Gin Asp Asn Ala Val Ser His His Thr Trp Glu Phe Gin Thr Ser 
20 25 30 

Ser Pro Val Phe Arg Arg Gly Gin Val Phe His Leu Arg Leu Val Leu 
35 40 45 

Asn Gin Pro Leu Gin Ser Tyr His Gin Leu Lys Leu Glu Phe Ser Thr 
50 55 60 

Gly Pro Asn Pro Ser He Ala Lys His Thr Leu Val Val Leu Asp Pro 

70 75 80 

Arg Thr Pro Ser Asp His Tyr Asn Trp Gin Ala Thr Leu Gin Asn Glu 
85 90 95 

Ser Gly Lys Glu Val Thr Val Ala Val Thr Ser Ser Pro Asn Ala He 
100 105 110 

Leu Gly Lys Tyr Gin Leu Asn Val Lys Thr Gly Asn His He Leu Lys 
115 120 125 
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Ser Glu Glu Asn 
130 

Asp Met Val Phe 
145 

Asn Asp Thr Gly 



Lys Pro Trp Asn 
180 

lie Ser Leu Leu 
195 

Pro Val Leu Val 

210 

Gly Gin Gly Val 

225 

Thr Ala Pro Tyr 



Tyr Asn Thr Lys 
260 

Gly lie Leu Thr 
275 

Val Thr Gly Phe 
290 

Asp Thr Tyr Val 
305 

Asp Ser Val Trp 



Pro Tyr Asp Gly 
340 

Gin Gly Val Phe 
355 

Gly Asp lie Phe 
370 

Asn Gly Asp Arg 
385 

Glu Leu His Val 



Ser Thr Lys Ala 
420 

Tyr Lys Tyr Pro 



lie Leu 

Met Pro 
150 

Cys His 
165 

Phe Gly 

Thr Glu 

Cys Arg 

Leu lie 
230 

Lys Trp 
245 

Gin Ala 

Thr Val 

Asp Ser 

Asn Glu 
310 

Asn Phe 
325 

Trp Gin 

Cys Cys 

lie Val 

Leu lie 
390 

lie Ser 
405 

Val Gly 
Glu Gly 



Tyr Leu 
135 

Asp Glu 

Tyr Val 

Gin Phe 

Ser Ser 
200 

Ala Met 
215 

Gly Asn 
Thr Gly 
Val Cys 

Leu Arg 

280 

Ala His 
295 

Asn Gly 
His Val 
Ala Val 

Gly Pro 

360 

Tyr Asp 
375 

Trp Leu 
Met Glu 
Gin Asp 
Ser Ser 



246 

Leu Phe 

Asp Glu 

Gly Ala 
170 

Glu Lys 
185 

Leu Lys 
Cys Ala 
Trp Thr 

Ser Ala 

250 

Phe Gly 
265 

Ala Leu 
Asp Thr 
Glu Lys 

Trp Thr 

330 

Asp Ala 
345 

Ser Pro 
Thr Arg 
Val Lys 

Thr Thr 

410 

Arg Arg 
425 

Glu Glu 



Asn Pro 
140 

Arg Lys 
155 

Ala Arg 



Asn Val 



Pro Thr 



Met Met 
220 

Gly Asp 
235 

Pro lie 

Gin Cys 

Gly He 

Glu Arg 
300 

He Thr 
315 

Asp Ala 

Thr Pro 

Leu Thr 

Phe Val 
380 

Met Val 
395 

Ser He 
Arg Asp 



Arg Gin 



Trp Cys 

Glu Tyr 

Ser He 

Leu Asp 
190 

Asp Arg 

205 

Ser Phe 
Tyr Glu 
Leu Gin 

Trp Val 

270 

Pro Ala 
285 

Asn Leu 

Ser Met 

Trp Met 

Gin Glu 
350 

Ala He 
3 65 

Phe Ser 

Asn Gly 

Gly Lys 

He Thr 
430 

Val Met 



Lys Glu 

He Leu 
160 

Lys Cys 
175 

Cys Cys 

Arg Asp 

Glu Lys 

Gly Gly 
240 

Gin Tyr 

255 

Phe Ala 

Arg Ser 

Thr Val 

Thr His 
320 

Lys Arg 

335 

Arg Ser 

Arg Lys 

Glu Val 

Gin Glu 
400 

Asn He 
415 

Tyr Glu 
Asp His 
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435 

Ala Phe Leu Leu Leu 
450 

Glu Asn Phe Leu His 
465 

Asn Ser Val Asn Phe 
485 

Gin Asn Val Asn He 
500 

Lys Lys Met Ala Lys 
515 

Gly Gin Val Ser Glu 
530 



440 

Ser Ser Glu Arg Glu 
455 

Met Ser Val Gin Ser 
470 

Thr Val He Leu Lys 
490 

Leu Gly Ser Phe Glu 
505 

Leu Cys Asp Leu Asn 
520 

Val Thr Leu Thr Leu 
535 



445 

His Arg Gin Pro Val Lys 
460 

Asp Asp Val Leu Leu Gly 
475 480 

Arg Lys Thr Ala Ala Leu 
495 

Leu Gin Leu Tyr Thr Gly 
510 

Lys Thr Ser Gin He Gin 
525 

Asp Ser Lys Thr Tyr He 
540 



Asn Ser Leu Ala He Leu Asp Asp 
545 550 

He Ala Glu He Val Glu Ser Lys 
565 

Thr Ser Asn Gin Tyr Pro Glu Phe 
580 

Arg He Gly Gin Leu Leu Val Cys 
595 600 



Glu Pro Val He Arg Gly Phe He 
555 560 

Glu He Met Ala Ser Glu Val Phe 
570 575 

Ser He Glu Leu Pro Asn Thr Gly 
585 590 

Asn Cys He Phe Lys Asn Thr Leu 
605 



Ala lie Pro Leu Thr Asp Val Lys Phe Ser Leu Glu Ser Leu Gly He 
610 615 620 

Ser Ser Leu Gin Thr Ser Asp His Gly Thr Val Gin Pro Gly Glu Thr 
625 630 635 640 

He Gin Ser Gin He Lys Cys Thr Pro He Lys Thr Gly Pro Lys Lys 
645 650 655 

Phe He Val Lys Leu Ser Ser Lys Gin Val Lys Glu He Asn Ala Gin 
660 665 670 

Lys He Val Leu He Thr Lys 
675 



<210> 634 

<211> 5668 

<212> DNA 

<213> Homo sapiens 



<400> 634 

gtcacttagg aaaaggtgtc ctttcgggca 
gacactctgg acagcacccg gaccctgtac 
tacagtgaaa gcgacttggt gaattttatt 



gccgggctca gcatgaggaa cagaaggaat 60 
tccagcgcgt ctcggagcac agacttgtct 120 
caagcaaatt ttaagaaacg agaatgtgtc 180 
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"t.jtttacca aagattccaa ggccacggag 
^agcacargg aaggcaccca gatcaaccaa 
;;c c^at 1 1 c czaccgacgc ctttggggat 
_iaqtatatac gtctgtcctg cgacacggac 
ra-'-r qgcacc tgaaaacacc caacctggtc 
gcccrgaagc cgcgcatgcg caagatcttc 
latqcttgga ttctcacggg aggcacccat 
gtqagagata acaccatcag caggagttca 
gcttggggca tggtctccaa ccgggacacc 
•:tiitagccc agtaccttat ggatgacttc 
aaccaoacac atttgctgct cgtggacaat 
aagctccgqa atcagctaga gaagcatatc 
ggtggcaagd tccccattgt gtgttttgcc 
atcaatacct ccaticaaaaa taaaattcct 
gctgatgtga tcgctagcct ggtggaggtg 
gagaagctgg tgcqcttttt accccgcacg 
agttggatca aatqqctcaa agaaattctc 
atggaagaag ctggggotga aattgtgagc 
rtcagcacca gtgaqcaaqa caaggataac 
tggaaccagc tggacttaqc caatgatgag 
gctgaccttc aagaagtcat gtttacggct 
ctctttctgg agaatggctt gaacctacgg 
ctcttctcca accacttcag cacgcttgtg 
tataatgatg ccctcctcac gtttgtctgg 
cggaaggaag acagaaatgg ccgggacgag 
attactcggc accccctqca agctctcttc 
ctctccaaag tcatttggga gcagaccagg 
aagcttctga agactctggc caaagtgaag 
gagctggcta atgagtacga gacccgggct 
gatgaagact tggcagaaca gctgctggtc 
tgtctggagc tggcggtgga ggccacagac 
aattttcttt ctaagcaatg gtatggagag 
atcctgtgtc tgtttattat acccttggtg 
cctgtcgaca agcacaagaa gctgctttgg 
gtggtcttct cctggaatgt ggtcttctac 
ctgctcatgg atttccattc ggtgccacac 
tttgtcctct tctgtgatga agtgagacag 
gacctgtgga atgtgatgga cacgctgggg 
cggctccact cttctaataa aagctctttg 
tacattattt tcactctaag attgatccac 
aagattataa tgctgcagag gatgctgatc 
gtgtggatgg tggcctttgg cgtggccagg 
cggaggtgga tattccgttc ggtcatctac 
cccagtgacg tggatggtac cacgtatgac 
t.ccaagccac tgtgtgtgga gctggatgag 
accatccccc tggtgtgcat ctacatgtta 
gtcgccatgt ttggctacac ggtgggcacc 
ttccagaggt acttcctggt gcaggagtac 
atcgtcttcg cttacttcta catggtggtg 
aaaaacatgg agtcttctgt ctgctgtttc 
gagggtgtca tgaaggaaaa ctaccttgtc 
gaggaaatga ggcatcgatt tagacaactg 
r-tgaaagaga ttgctaataa aatcaaataa 
taattatagc aagatcatat taaggaatgc 
tgagagattt tcagacccct gggtacatgg 
^igaccttgag aataaagtgt gtgattggtt 
tatttccttt atgtgtttct ccagaatggt 
gactggaggt tgatagttta agtgtgttct 



aargtgtgca agtgtggcta tgcccagagc 240 
agtgagaaat ggaactacaa gaaacacacc 300 
attcagtcrg agacactggg gaagaaaggg 360 
gcqgaaatcc tttacgagct gctgacccag 420 
atttctgtga ccgggggcgc caagaacttc 480 
agccggctca tctacatcgc gcagtccaaa 540 
tatggcctga cgaagtacat cggggaggtg 600 
gaggagaata ttgtggccat tggcatagca 660 
ctcatcagga attgcgatgc tgagggctat 720 
acaagggatc cactgtatat cctggacaac 780 
ggctgtcatg gacatcccac tgtcgaagca 840 
tctgagcgca ctattcaaga ttccaactat 900 
caaggaggtg gaaaagagac tttgaaagcc 960 
tgtgtggtgg tggaaggctc gggccggatc 1020 
gaggatgccc cgacatcttc tgccgtcaag 1080 
gtgtcccggc tgtctgagga ggagactgag 1140 
gaatgttctc acctattaac agttattaaa 1200 
aatgccatct cctacgctct atacaaagcc 1260 
tggaatgggc agctgaagct tctgctggag 1320 
attttcacca atgaccgccg atgggagtct 1380 
ctcataaagg acagacccaa gtttgtccgc 1440 
aagtttctca cccatgatgt cctcactgaa 1500 
taccggaatc tgcagatcgc caagaattcc 1560 
aaactggttg cgaacttccg aagaggcttc 1620 
atggacatag aactccacga cgtgtctcct 1680 
atctgggcca ttcttcagaa taagaaggaa 1740 
ggctgcactc tggcagccct gggagccagc 1800 
aacgacatca atgctgctgg ggagtccgag 1860 
gttgagctgt tcactgagtg ttacagcagc 1920 
tattcctgtg aagcttgggg tggaagcaac 1980 
cagcatttca ccgcccagcc tggggtccag 2040 
atttcccgag acaccaagaa ctggaagatt 2100 
ggctgtggct ttgtatcatt taggaagaaa 2160 
tactatgtgg cgttcttcac ctcccccttc 2220 
atcgccttcc tcctgctgtt tgcctacgtg 2280 
ccccccgagc tggtcctgta ctcgctggtc 2340 
tggtacgtaa atggggtgaa ttattttact 2400 
cttttttact tcatagcagg aattgtattt 2460 
tattctggac gagtcatttt ctgtctggac 2520 
atttttactg taagcagaaa cttaggaccc 2580 
gatgtgttct tcttcctgtt cctctttgcg 2640 
caagggatcc ttaggcagaa tgagcagcgc 2700 
gagccctacc tggccatgtt cggccaggtg 2760 
tttgcccact gcaccttcac tgggaatgag 2820 
cacaacctgc cccggttccc cgagtggatc 2880 
tccaccaaca tcctgctggt caacctgctg 2940 
gtccaggaga acaatgacca ggtctggaag 3000 
tgcagccgcc tcaatatccc cttccccttc 3060 
aagaagtgct tcaagtgttg ctgcaaggag 3120 
aaaaatgaag acaatgagac tctggcatgg 3180 
aagatcaaca caaaagccaa cgacacctca 3240 
gatacaaagc ttaatgatct caagggtctt 3300 
aactgtatga aactctaatg gagaaaaatc 3360 
tgatgaacaa ttttgctatc gactactaaa 3420 
tggatgattt taaatcaccc tagtgtgctg 3480 
tcatacttga agacggatat aaaggaagaa 3540 
gcctgtttct ctctgtgtct caatgcctgg 3600 
taccgcctcc tttttccttt aatcttattt 3660 
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ttgatgaaca catatatagg agaacatcta 
ctcctgtatt gttattttgt tcatttccaa 
atgtgactaa ttagttggca tattgttaaa 
atgctgcagc aagaggaccc cgctctcttc 
tcttacctgt cagaggaggt gacaaggcag 
tattgaagga accaccccca ttcctaaata 
aaggcactac tgactttgtt cttattggat 
aataatagct ggctattata gaaaatttag 
tgtccccatt accttaaggt aatcactgct 
tttttttcta tgtatgtctc aattctcttt 
tatatacttt ttatgtaagc tttttcactt 
tcatagcctt cttaaacatt atatcaataa 
ataattttgc tcattgaagg ctatctccag 
gaatcctatt gctgtatttg ggaaaatttt 
attattaaat attaaaatat cgatttatta 
tgtatagcaa ataggaatta ttaacttgag 
attaaaataa aatattatat ttaaccctag 
tattatggat ggtgggcaga tcacttgagg 
ggcaaaacca catctctact aaaaataaaa 
taatcccagc tactcagaag gctgaggtac 
tgcagtgaac caagattgca ccactgcact 
tgaaaataaa taaataaata aataaataaa 
gaatggtata gaattggaga gattatctta 
aagtggtggt atttgagcag gatgtgcaca 
cagctttgaa tacactataa actcagtggc 
aaaagatcta atttgaaaaa ctacaaaagc 
ctgggcagta gtgaaaataa ctactcacaa 
aataaaagat gcctttttac ttaaacgcca 
aacttgcatt agagagggaa ctgttaaatg 
ttgcaggtta ctctgagaat tttgcttatg 
taatgtattg aacatacttc taatcaaagg 
gtcctgtgta cttttgcaca actgagaatc 
ciaataaataa tggaggaatt gtcaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 



tcctatgaat aagaacctgg tcatgcttta 3720 
ttgattctct acttttccct tttttgtatt 3780 
agtctctcaa attaggccag attctaaaac 3840 
aggaaaagtg ttttcatttc tcaggatgct 3900 
tctcttgctc tcttggactc accaggctcc 3960 
tgtgaaaagt cgcccaaaat gcaaccttga 4020 
actcctctta tttattattt ttccattaaa 4080 
accatacaga gatgtagaaa gaacataaat 4140 
aacaatttct ggatggtttt tcaagtctat 4200 
caaaatttta cagaatgtta tcatactaca 4260 
agtattttat caaatatgtt tttattatat 4320 
ttgcataata ggcaacctct agcgattacc 4380 
ttgatcattg ggatgagcat ctttgtgcat 4440 
ccaaggttag attccaataa atatctattt 4500 
ttaaaaccat ttataaggct ttttcataaa 4 560 
cataagatat gagatacatg aacctgaact 4 620 
tttaagaaga agtcaatatg cttatttaaa 4 680 
tcaggagttc gagaccagcc tggccaacat 4740 
aaattagctg ggtgtggtgg tgcactcctg 4800 
aagaattgct ggaacctggg aggcggaggt 4 860 
ccagccgggg tgacagagtg agactccgac 4 920 
taaataaata aatattatgg atggtgaagg 4980 
ctgaacacct gtagtcccag ctttctctgg 5040 
aggcaattga aatgcccata attagtttct 5100 
tgaaggagga aattttagaa ggaagctact 5160 
attaactaaa aaagtttatt ttccttttgt 5220 
cattcactat gtttgcaagg aattaacaca 5280 
agacagaaaa cttgcccaat actgagaagc 534 0 
ttttcaaccc agttcatctg gtggatgttt 5400 
aaaaatcatt atttttagtg tagttcacaa 54 60 
tgctatgtcc ttgtgtatgg tactaaatgt 5520 
ctgcggcttg gtttaatgag tgtgttcatg 5580 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5640 

5668 



<210> 635 

<211> 1095 

<212> PRT 

<213> Homo sapiens 



<400> 635 

Met Arg Asn Arg Arg Asn Asp Thr Leu Asp Ser Thr Arg Thr Leu Tyr 
5 10 15 

Ser Ser Ala Ser Arg Ser Thr Asp Leu Ser Tyr Ser Glu Ser Asp Leu 
20 25 30 

Val Asn Phe lie Gin Ala Asn Phe Lys Lys Arg Glu Cys Val Phe Phe 
35 40 45 

Thr Lys Asp Ser Lys Ala Thr Glu Asn Val Cys Lys Cys Gly Tyr Ala 
50 55 60 

Gin Ser Gin His Met Glu Gly Thr Gin lie Asn Gin Ser Glu Lys Trp 
65 70 75 80 



Asn Tyr Lys Lys His Thr Lys Glu Phe Pro Thr Asp Ala Phe Gly Asp 
85 90 95 
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lie Gin Phe Glu Thr Leu Gly Lys Lys Gly Lys Tyr lie Arg Leu Ser 
100 105 ^ ^ 110 

Cys Asp Thr Asp Ala Glu He Leu Tyr Glu Leu Leu Thr Gin His Trp 
115 120 125 

iiis Leu Lys Thr Pro Asn Leu Val He Ser Val Thr Gly Gly Ala Lys 
130 135 140 

Asn Phe Ala Leu Lys Pre Arg Met Arg Lys He Phe Ser Arg Leu He 
145 150 155 160 

Tyr He Ala Gin Ser Lys Gly Ala Trp He Leu Thr Gly Gly Thr Kis 
165 170 175 

Tyr Gly Leu Thr Lys Tyr He Gly Glu Val Val Arg Asp Asn Thr He 
180 185 190 

Ser Arg Ser Ser Glu Glu Asn He Val Ala He Gly He Ala Ala Trp 
195 200 205 

Gly Met Val Ser Asn Arg Asp Thr Leu He Arg Asn Cys Asp Ala Glu 
210 215 220 

Gly Tyr Phe Leu Ala Gin Tyr Leu Met Asp Asp Phe Thr Arg Asp Pro 
225 230 235 240 

Leu Tyr He Leu Asp Asn Asn His Thr His Leu Leu Leu Val Asp Asn 
245 250 255 

Gly Cys His Gly His Pro Thr Val Glu Ala Lys Leu Arg Asn Gin Leu 
260 265 270 

Glu Lys His He Ser Glu Arg Thr He Gin Asp Ser Asn Tyr Gly Gly 
275 280 285 

Lys He Pro He Val Cys Phe Ala Gin Gly Gly Gly Lys Glu Thr Leu 
290 295 300 

Lys Ala He Asn Thr Ser He Lys Asn Lys He Pro Cys Val Val Val 
305 310 315 320 

Glu Gly Ser Gly Arg He Ala Asp Val He Ala Ser Leu Val Glu Val 
325 330 335 

Glu Asp Ala Pro Thr Ser Ser Ala Val Lys Glu Lys Leu Val Arg Phe 
340 345 350 

Leu Pro Arg Thr Val Ser Arg Leu Ser Glu Glu Glu Thr Glu Ser Trp 
355 360 365 

He Lys Trp Leu Lys Glu He Leu Glu Cys Ser His Leu Leu Thr Val 
370 375 380 

He Lys Met Glu Glu Ala Gly Asp Glu He Val Ser Asn Ala He Ser 
385 390 ^95 400 
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Tyr Ala Leu Tyr Lys Ala Phe Ser Thr Ser Glu Gin Asp Lys Asp Asn 
405 410 415 

Trp Asn Gly Gin Leu Lys Leu Leu Leu Glu Trp Asn Gin Leu Asp Leu 
420 425 430 

Ala Asn Asp Glu lie Phe Thr Asn Asp Arg Arg Trp Glu Ser Ala Asp 
435 440 445 

Leu Gin Glu Val Met Phe Thr Ala Leu lie Lys: Asp Arg Pro Lys Phe 
450 455 460 

Val Arg Leu Phe Leu Glu Asn Gly Leu Asn Leu Arg Lys Phe Leu Thr 
465 470 475 480 

His Asp Val Leu Thr Glu Leu Phe Ser Asn His Phe Ser Thr Leu Val 
485 490 495 

Tyr Arg Asn Leu Gin lie Ala Lys Asn Ser Tyr Asn Asp Ala Leu Leu 
500 505 510 

Thr Phe Val Trp Lys Leu Val Ala Asn Phe Arg Arg Gly Phe Arg Lys 
515 520 525 

Glu Asp Arg Asn Gly Arg Asp Glu Met Asp lie Glu Leu His Asp Val 
530 535 540 

Ser Pro He Thr Arg His Pro Leu Gin Ala Leu Phe He Trp Ala He 
545 550 555 560 

Leu Gin Asn Lys Lys Glu Leu Ser Lys Val He Trp Glu Gin Thr Arg 
565 570 575 

Gly Cys Thr Leu Ala Ala Leu Gly Ala Ser Lys Leu Leu Lys Thr Leu 
580 585 590 

Ala Lys Val Lys Asn Asp He Asn Ala Ala Gly Glu Ser Glu Glu Leu 
595 600 605 

Ala Asn Glu Tyr Glu Thr Arg Ala Val Glu Leu Phe Thr Glu Cys Tyr 
610 615 620 

Ser Ser Asp Glu Asp Leu Ala Glu Gin Leu Leu Val Tyr Ser Cys Glu 
625 630 635 640 

Ala Trp Gly Gly Ser Asn Cys Leu Glu Leu Ala Val Glu Ala Thr Asp 
645 650 655 

Gin His Phe Thr Ala Gin Pro Gly Val Gin Asn Phe Leu Ser Lys Gin 
660 665 670 

Trp Tyr Gly Glu He Ser Arg Asp Thr Lys Asn Trp Lys He He Leu 
675 680 685 

Cys Leu Phe He He Pro Leu Val Gly Cys Gly Phe Val Ser Phe Arg 
690 695 700 

Lys Lys Pro Val Asp Lys His Lys Lys Leu Leu Trp Tyr Tyr Val Ala 
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705 



710 



715 



720 



Phe Phe Thr Ser Pro Phe Val Val Phe Ser Trp Asn Val Val Phe Tyr 
725 730 ^ 735 

lie Ala Phe Leu Leu Leu Phe Ala Tyr Val Leu Leu Met Asp Phe His 

740 745 750 

Ser Val Pro His Pro Pro Glu Leu Val Leu Tyr Ser Leu Val Phe Val 
755 760 765 

Leu Phe Cys Asp Glu Val Arg Gin Trp Tyr Val Asn Gly Val Asn Tyr 
770 775 ^ 780 

Phe Thr Asp Leu Trp Asn Val Met Asp Thr Leu Gly Leu Phe Tyr Phe 
785 790 795 800 

lie Ala Gly lie Val Phe Arg Leu His Ser Ser Asn Lys Ser Ser Leu 
805 810 815 

Tyr Ser Gly Arg Val He Phe Cys Leu Asp Tyr He He Phe Thr Leu 

820 825 830 

Arg Leu He His He Phe Thr Val Ser Arg Asn Leu Gly Pro Lys He 
835 840 845 

lie Met Leu Gin Arg Met Leu He Asp Val Phe Phe Phe Leu Phe Leu 
850 855 860 

Phe Ala Val Trp Met Val Ala Phe Gly Val Ala Arg Gin Gly He Leu 
865 870 875 880 

Arg Gin Asn Glu Gin Arg Trp Arg Trp He Phe Arg Ser Val He Tyr 
885 890 895 

Glu Pro Tyr Leu Ala Met Phe Gly Gin Val Pro Ser Asp Val Asp Gly 
900 905 910 

Thr Thr Tyr Asp Phe Ala His Cys Thr Phe Thr Gly Asn Glu Ser Lys 
915 920 925 

Pro Leu Cys Val Glu Leu Asp Glu His Asn Leu Pro Arg Phe Pro Glu 
930 935 940 

Trp He Thr He Pro Leu Val Cys He Tyr Met Leu Ser Thr Asn He 
945 950 955 960 

Leu Leu Val Asn Leu Leu Val Ala Met Phe Gly Tyr Thr Val Gly Thr 
965 970 975 

Val Gin Glu Asn Asn Asp Gin Val Trp Lys Phe Gin Arg Tyr Phe Leu 

980 985 990 

Val Gin Glu Tyr Cys Ser Arg Leu Asn He Pro Phe Pro Phe He Val 
995 1000 1005 



Phti Ala Tyr Phe Tyr Met Val Val Lys Lys Cys fhe Lys Cys Cys Cys 
1010 1015 1020 
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Lys Glu Lys Asn Met Glu Ser Ser Val Cys Cys Phe Lys Asn Glu Asp 
1025 1030 1035 1040 

Asn Glu Thr Leu Ala Trp Glu Gly Val Met Lys Glu Asn Tyr Leu Val 

1045 1050 1055 

Lys lie Asn Thr Lys Ala Asn Asp Thr Ser Glu Glu Met Arg His Arg 
1060 1065 1070 

Phe Arg Gin Leu Asp Thr Lys Leu Asn Asp Leu Lys Gly Leu Leu Lys 
1075 1080 1085 

Glu lie Ala Asn Lys lie Lys 
1090 1095 



<210> 636 

<211> 3639 

<212> DNA 

<213> Homo sapiens 



<400> 636 

gattacgcaa 

gagctcggat 

agcatgagga 

tctcggagca 

tttaagaaac 

aagtgtggct 

tggaactaca 

gagacactgg 

ctt tacgagc 

accgggggcg 

atctacatcg 

atgaagtaca 

attgtggcca 

aattgcgatg 

ccactgtata 

ggacatccca 

actattcaag 

ggaaaagaga 

gtggaaggct 

ctgacatctt 

ctgcctgagg 

cacctattaa 

tcctacgctc 

cagctgaagc 

aatgaccgcc 

gacagaccca 

acccatgatg 

ctgcagatcg 

gcgaacttcc 

gaactccacg 

attcttcaga 

ctggcagccc 

aatgctgctg 

ttcactgagt 



gctatttagg 
ccctagtaac 
acagaaggaa 
cagacttgtc 
gagaatgtgt 
atgcccagag 
agaaacacac 
ggaagaaagg 
tgctgaccca 
ccaagaactt 
cgcagtccaa 
tcggggaggt 
ttggcatagc 
ctgagggcta 
tcctggacaa 
ctgtcgaagc 
attccaacta 
ctttgaaagc 
cgggccagat 
ctgccgtcaa 
aggagactga 
cagttattaa 
tatacaaagc 
ttctgctgga 
gatgggagtc 
agtttgtccg 
tcctcactga 
ccaagaattc 
gaagaggctt 
acgtgtctcc 
ataagaagga 
tgggagccag 
gggagtccga 
gttacagcag 



tgacactata 
ggccgccagt 
tgacactctg 
ttacagtgaa 
cttctttacc 
ccagcacatg 
caaggaattt 
gaagtatata 
gcactggcac 
cgccctgaag 
aggtgcttgg 
ggtgagagat 
agcttggggc 
ttttttagcc 
caaccacaca 
aaagctccgg 
tggtggcaag 
catcaatacc 
cgctgatgtg 
ggagaagctg 
gagttggatc 
aatggaagaa 
cttcagcacc 
gtggaaccag 
tgctgacctt 
cctctttctg 
actcttctcc 
ctataatgat 
ccggaaggaa 
tattactcgg 
actctccaaa 
caagcttctg 
ggagctggct 
cgatgaagac 



gaatwctcag 
gtgctggaat 
gacagcaccc 
agcgacttgg 
aaagattcca 
gaaggcaccc 
cctaccgacg 
cgtctgtcct 
ctgaaaacac 
ccgcgcatgc 
attctcacgg 
aacaccatca 
atggtctcca 
cagtacctta 
catttgctgc 
aatcagctag 
atccccattg 
tccatcaaaa 
at cgct agcc 
gtgcgctttt 
aaatggctca 
gctggggatg 
agtgagcaag 
ctggacttag 
caagaagtca 
gagaatggct 
aaccacttca 
gccctcctca 
gacagaaatg 
caccccctgc 
gtcatttggg 
aagactctgg 
aatgagtacg 
ttggcagaac 



cttgcatcaa 
tcgcccttgc 
ggaccctgta 
tgaattttat 
aggccacgga 
agatcaacca 
cctttgggga 
gcgacacgga 
ccaacctggt 
gcaagatctt 
gaggcaccca 
gcaggagttc 
accgggacac 
tggatgactt 
tcgtggacaa 
agaagtatat 
tgtgttttgc 
ataaaattcc 
tggtggaggt 
taccccgcac 
aagaaattct 
aaattgtgag 
acaaggataa 
ccaatgatga 
tgtttacggc 
tgaacctacg 
gcacgcttgt 
cgtttgtctg 
gccgggacga 
aagctctctt 
agcagaccag 
ccaaagtgaa 
agacccgggc 
agctgctggt 



gcttggtacc 
agccgggctc 
ctccagcgcg 
tcaagcaaat 
gaatgtgtgc 
aagtgagaaa 
tattcagttt 
cgcggaaatc 
catttctgtg 
cagccggctc 
ttatggcctg 
agaggagaat 
cctcatcagg 
cacaagagat 
tggctgtcat 
ctctgagcgc 
ccaaggaggt 
ttgtgtggtg 
ggaggatgcc 
ggtgtcccgg 
cgaatgttct 
caatgccatc 
ctggaatggg 
gattttcacc 
tctcataaag 
gaagtttctc 
gtaccggaat 
gaaactggtt 
gatggacata 
catctgggcc 
gggctgcact 
gaacgacatc 
tgttgagctg 
ctattcctgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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gaaqcttggg 
at cgcccagc 
":dc ic'caaga 
z\: r q tat cat 
gcgttcttca 
ctcctgctgt 
ntqqtcctgt 
aatggggtga 
ttcatagcag 
cqa gtcattt 
.ft. a. . -jcagaa 
ttct tcctgt 
cttaggcaga 
ctggccatgt 
tgcaccttca 
ccccggttcc 
at cctgctgg 
aacaatgacc 
ctcaatatcc 
1 1 caagtgtt 
gacaatgaga 
acaaaagcca 
ct taatgatc 
aactctaatg 
ttttgctatc 
taaatcaccc 
atccatcaca 



gtggaagcaa 
ct ggggt cca 
act ggaagat 
ttaggaagaa 
cctccccctt 
ttgcctacgt 
actcgctggt 
attattttac 
gaattgtatt 
tctgtctgga 
acttaggacc 
tcctctttgc 
atgagcagcg 
tcggccaggt 
ctgggaatga 
ccgagtggat 
tcaacctgct 
aggtctggaa 
ccttcccctt 
gctgcaagga 
ctctggcat g 
acgacacct c 
tcaagggtct 
gagaaaaatc 
gactact aaa 
tagtgtgct g 
ctggcggccg 



ctgtct ggag 
gaatrtt ctt 
t at cot gt gt 
acctgt cgac 
cgtggt cttc 
gctgct catg 
cttt gt cctc 
t gacctgt gg 
tcggctccac 
ctacattatt 
caagattata 
ggwgtggatg 
ctggaggtgg 
gcccagtgac 
gtccaagcca 
caccat cccc 
ggtcgccatg 
gttccagagg 
catcgtcttc 
gaaaaacatg 
ggsgggtgtc 
agaggaaatg 
tctgaaagag 
taattatagc 
tgagagattt 
agaccttgag 
ctcgagcatg 



ctggcggtgg 
ret aagcaa t 
ctgtttatta 
aagcacaaga 
tcctggaatg 
gattt ccatt 
ttctgtgatg 
aatgtgatgg 
tcttctaata 
ttcactctaa 
atgctgcaga 
gtggcctttg 
atattccgtt 
gtggatggta 
ctgtgtgtgg 
ctggt gtgca 
tttggctaca 
tacttcctgg 
gcttacttct 
gagtcttctg 
atgaaggaaa 
aggcatcgat 
attgctaata 
aagatcatat 
tcagacccct 
aataaagtgt 
catctagag 



aggccacaga 
ggtat ggaga 
tacccttggt 
agctgctttg 
tggtcttcta 
cggtgccaca 
aagtgagaca 
acacgctggg 
aaagctcttt 
gattgatcca 
ggatgctgat 
gcgtggccag 
cggtcatcta 
ccacgtatga 
agctggatga 
t ctacat gtt 
cggtgggcac 
tgcaggagta 
acatggtggt 
tctgctgttt 
actaccttgt 
ttagacaact 
aaatcaaata 
taaggaatgc 
gggtacatgg 
gaagggcgaa 



ccagcatttc 
gattt cccga 
gggctgtggc 
gtactatgtg 
cat cgccttc 
cccccccgag 
gtggtacgta 
gcttttttac 
gtattctgga 
catttttact 
cgatgt gttc 
gcaagggat c 
cgagccctac 
ctttgcccac 
gcacaacctg 
at ccaccaac 
cgtccaggag 
ctgcagccgc 
gaagaagtgc 
caaaaatgaa 
caagatcaac 
ggatacaaag 
aaactgtatg 
tgatgaacaa 
tggatgattt 
ttctgcagat 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3639 



<210> 637 
<211> 1095 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 

<222> (1) . . . (1095) 

<223> Xaa = Any Amino Acid 

<400> 637 

Met Arg Asn Arg Arg Asn Asp Thr Leu Asp Ser Thr Arg Thr Leu Tyr 
5 10 15 

Ser Ser Ala Ser Arg Ser Thr Asp Leu Ser Tyr Ser Glu Ser Asp Leu 
20 25 30 

Val Asn Phe lie Gin Ala Asn Phe Lys Lys Arg Glu Cys Val Phe Phe 
35 40 45 

Thr Lys Asp Ser Lys Ala Thr Glu Asn Val Cys Lys Cys Gly Tyr Ala 
50 55 60 

Gin Ser Gin His Met Glu Gly Thr Gin lie Asn Gin Ser Glu Lys Trp 

65 70 75 80 



Asn Tyr Lys Lys His Thr Lys Glu Phe Pro Thr Asp Ala Phe Gly Asp 
85 90 95 



lie Gin Phe Glu Thr Leu Gly Lys Lys Gly Lys Tyr lie Arg Leu Ser 
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100 105 110 

Cys Asp Thr Asp Ala Glu lie Leu Tyr Glu Leu Leu Thr Gin His Trp 
115 120 125 

His Leu Lys Thr Pro Asn Leu Val lie Ser Val Thr Gly Gly Ala Lys 
130 135 140 

Asn Phe Ala Leu Lys Pro Arg Met Arg Lys He Phe Ser Arg Leu He 
145 150 155 160 

Tyr He Ala Gin Ser Lys Gly Ala Trp He Leu Thr Gly Gly Thr His 
165 170 175 

Tyr Gly Leu Met Lys Tyr He Gly Glu Val Val Arg Asp Asn Thr He 
180 185 190 

Ser Arg Ser Ser Glu Glu Asn He Val Ala He Gly He Ala Ala Trp 
195 200 205 

Gly Met Val Ser Asn Arg Asp Thr Leu He Arg Asn Cys Asp Ala Glu 
210 215 220 

Gly Tyr Phe Leu Ala Gin Tyr Leu Met Asp Asp Phe Thr Arg Asp Pro 
225 230 235 240 

Leu Tyr He Leu Asp Asn Asn His Thr His Leu Leu Leu Val Asp Asn 
245 250 255 

Gly Cys His Gly His Pro Thr Val Glu Ala Lys Leu Arg Asn Gin Leu 
260 265 270 

^ - J- -j-^ ^^j^ wj-i^ ^-i- ^ iiij. vjj-xi >h.op oer /T-hiri lyx <j,iy ^xy 

275 280 285 

Lys He Pro He Val Cys Phe Ala Gin Gly Gly Gly Lys Glu Thr Leu 
290 295 300 

Lys Ala He Asn Thr Ser He Lys Asn Lys He Pro Cys Val Val Val 
305 310 315 320 

Glu Gly Ser Gly Gin He Ala Asp Val He Ala Ser Leu Val Glu Val 
325 330 335 

Glu Asp Ala Leu Thr Ser Ser Ala Val Lys Glu Lys Leu Val Arg Phe 
340 345 350 

Leu Pro Arg Thr Val Ser Arg Leu Pro Glu Glu Glu Thr Glu Ser Trp 
355 360 365 

He Lys Trp Leu Lys Glu He Leu Glu Cys Ser His Leu Leu Thr Val 
370 375 380 

He Lys Met Glu Glu Ala Gly Asp Glu He Val Ser Asn Ala He Ser 
385 390 395 400 

Tyr Ala Leu Tyr Lys Ala Phe Ser Thr Ser Glu Gin Asp Lys Asp Asn 
405 410 415 
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Trp Asn Giy Gin Leu Lys Leu Leu Leu Glu Trp Asn Gin Leu Asp Leu 
420 425 ^ 430 

Ala Asn Asp Glu lie Phe Thr Asn Asp Arg Arg Trp Glu Ser Ala Asp 
435 440 ^ ^ 445 

Leu Gin Glu Val Met Phe Thr Ala Leu lie Lys Asp Arg Pro Lys Phe 
450 455 ^ 460 

Val Arg I^eu Phe Leu Glu Asn Gly Leu Asn Leu Arg Lys Phe Leu Thr 

465 470 475 480 

His Asp Val Leu Thr Glu Leu Phe Ser Asn His Phe Ser Thr Leu Val 
485 490 495 

Tyr Arg Asn Leu Gin lie Ala Lys Asn Ser Tyr Asn Asp Ala Leu Leu 
500 505 510 

Thr Phe Val Trp Lys Leu Val Ala Asn Phe Arg Arg Gly Phe Arg Lys 
515 520 525 

Glu Asp Arg Asn Gly Arg Asp Glu Met Asp lie Glu Leu His Asp Val 
530 535 540 

Ser Pro lie Thr 7\rg His Pro Leu Gin Ala Leu Phe lie Trp Ala lie 
545 550 555 560 

Leu Gin Asn Lys Lys Glu Leu Ser Lys Val lie Trp Glu Gin Thr Arg 
565 570 575 

^^^^y Cys Thr Leu Ala Ala Leu Gly Ala Ser Lys Leu Leu Lys Thr Leu 
580 585 590 

Ala Lys Val Lys Asn Asp lie Asn Ala Ala Gly Glu Ser Glu Glu Leu 
595 600 605 

Ala Asn Glu Tyr Glu Thr Arg Ala Val Glu Leu Phe Thr Glu Cys Tyr 
610 615 620 

Ser Ser Asp Glu Asp Leu Ala Glu Gin Leu Leu Val Tyr Ser Cys Glu 
625 630 635 640 

Ala Trp Gly Gly Ser Asn Cys Leu Glu Leu Ala Val Glu Ala Thr Asp 
645 650 655 

Gin His Phe lie Ala Gin Pro Gly Val Gin Asn Phe Leu Ser Lys Gin 
660 665 670 

Trp Tyr Gly Glu lie Ser Axq Asp Thr Lys Asn Trp Lys lie lie Leu 
675 680 685 

Cys Leu Phe lie lie Pro Leu Val Gly Cys Gly Phe Val Ser Phe Arg 
690 695 700 

Lys Lys Pro Val Asp Lys Kis Lys Lys Leu Leu Trp Tyr Tyr Val Ala 
705 710 715 720 
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Phe Phe Thr Ser Pro Phe Val Val Phe Ser Trp Asn Val Val Phe Tyr 
'725 730 735 

lie Ala Phe Leu Leu Leu Phe Ala Tyr Val Leu Leu Met Asp Phe His 
740 745 750 

Ser Val Pro His Pro Pro Glu Leu Val Leu Tyr Ser Leu Val Phe Val 
755 760 765 

Leu Phe Cys Asp Glu Val Arg Gin Trp Tyr Val Asn Gly Val Asn Tyr 
770 775 780 

Phe Thr Asp Leu Trp Asn Val Met Asp Thr Leu Gly Leu Phe Tyr Phe 
785 790 795 800 

lie Ala Gly lie Val Phe Arg Leu His Ser Ser Asn Lys Ser Ser Leu 
805 810 . 815 

Tyr Ser Gly Arg Val lie Phe Cys Leu Asp Tyr lie He Phe Thr Leu 
820 825 830 

Arg Leu He His He Phe Thr Val Ser Arg Asn Leu Gly Pro Lys He 
835 840 845 

He Met Leu Gin Arg Met Leu He Asp Val Phe Phe Phe Leu Phe Leu 
850 855 860 

Phe Ala Xaa Trp Met Val Ala Phe Gly Val Ala Arg Gin Gly He Leu 
865 870 875 880 

Arg Gin Asn Glu Gin Arg Trp Arg Trp He Phe Arg Ser Val He Tyr 
885 890 895 

Glu Pro Tyr Leu Ala Met Phe Gly Gin Val Pro Ser Asp Val Asp Gly 
900 905 910 

Thr Thr Tyr Asp Phe Ala His Cys Thr Phe Thr Gly Asn Glu Ser Lys 
915 920 925 

Pro Leu Cys Val Glu Leu Asp Glu His Asn Leu Pro Arg Phe Pro Glu 
930 935 940 

Trp He Thr He Pro Leu Val Cys He Tyr Met Leu Ser Thr Asn He 
945 950 955 960 

Leu Leu Val Asn Leu Leu Val Ala Met Phe Gly Tyr Thr Val Gly Thr 
965 970 975 

Val Gin Glu Asn Asn Asp Gin Val Trp Lys Phe Gin Arg Tyr Phe Leu 
980 985 990 

Val Gin Glu Tyr Cys Ser Arg Leu Asn He Pro Phe Pro Phe He Val 
995 1000 1005 

Phe Ala Tyr Phe Tyr Met Val Val Lys Lys Cys Phe Lys Cys Cys Cys 
1010 1015 1020 

Lys Glu Lys Asn Met Glu Ser Ser Val Cys Cys Phe Lys Asn Glu Asp 
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102b 103C 1035 1040 

Asn G:u Thr Leu Ala Trp Glu Gly Val Met Lys Glu Asn Tyr Leu Val 
1045 1050 1055 

Ly- Tic- Asn Thr Lys Ala Asn Asp Thr Ser Glu Glu iMet Arg His Arg 
1060 1065 1070 

Phe Arg Gin Leu Asp Thr Lys Leu Asn Asp Leu Lys Gly Leu Leu Lys 
1075 1080 1085 

Glu lie Ala Asn Lys lie Lys 
1090 1095 



<210> 638 
<2:il> 15 
<212> PRT 

<213> Homo sapiens 
<400> 638 

Arg Met Pro Thr Val Leu Gin Cys Val Asn Val Ser Val Val Ser 
5 10 15 

<210> 639 
<211> 45 
<212> DNA 

<213> Homo sapiens 
<400> 639 

agaatgccta ccgtgctgca gtgcgtgaac gtgtcggtgg tgtct 45 



<210> 640 
<211> 45 
<212> DNA 

<213> Homo sapiens 
<400> 640 

gagccaggga gccagatggt ggaggccagc ctctccgtac ggcac 45 

<210> 641 
<211> 45 
<"2I2> DNA 

<213> Homo sapiens 
<400> 641 

gaggccgacc aagagccagg gagccagatg gtggaggcca gcctc 45 

<210> 642 
<211> 45 

'':^^12> DNA 

<213> Homo sapiens 
<400> 642 

gqr^ctgcaca gtcttgaggc cgaccaagag ccagggagcc agatg 45 
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<210> 643 

<211> 45 

<212> DNA 

<213> Homo sapiens 

<400> 643 

tacaccatcg ggctgggcct gcacagtctt gaggccgacc aagag 

<210> 644 

<211> 42 

<212> DNA 

<213> Homo sapiens 

<400> 644 

ttccagaact cctacaccat cgggctgggc ctgcacagtc tt 

<210> 645 

<211> 45 

<212> DNA 

<213> Homo sapiens 

<400> 645 

ctgtcagccg cacactgttt ccagaactcc tacaccatcg ggctg 

<210> 64 6 

<211> 45 

<212> DNA 

<213> Homo sapiens 

<400> 646 

catccgcagt gggtgctgtc agccgcacac tgtttccaga actcc 

<2]0> 647 

<211> 45 

<212> DNA 

<213> Homo sapiens 

<400> 647 

tcgggcgtcc tggtgcatcc gcagtgggtg ctgtcagccg cacac 

<210> 648 

<211> 45 

<212> DNA 

<213> Homo sapiens 

<400> 648 

aacgaattgt tctgctcggg cgtcctggtg catccgcagt gggtg 

<210> 649 

<211> 45 

<212> DNA 

<213> Homo sapiens 

<400> 649 

gcactggtca tggaaaacga attgttctgc tcgggcgtcc tggtg 

<210> 650 
<211> 51 



45 



42 



45 



45 



45 



45 



45 
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2 1 3 > H C' IT , o s 5 c i e s 

tcQcaaccct ggcaggcggc actggtcatg gaaaacgaat tgttctgctc g 51 

<:nO> 651 
<2: 2> DNA 

<2J 3> Homo sapiens 
'.^OO 651 

atcagcatra ct*_cgcagtg ccctaccgcg gggaactctt gcctc 45 

<210> 652 
<211> 45 
<212> DNA 

<213> Homo s^z\^x\s 
<400> 652 

tccgtgtccg acitctgacac catccggagc atcagcattg cttcg 45 

■^210> 653 
<211> 45 
<2:2> DNA 

<213> Horao sapiens 
<400> 653 

atcaagttgg acgaatccgt gtccgagtct gacaccatcc ggagc 45 

<210> 654 
<211> 45 
--'212> DNA 

<2.13> Homo sapiens 
<400> 654 

aacgacctca tgctcatcaa gttggacgaa tccgtgtccg agtct 45 

<210> 655 

<211> 45 

<212> DNA 

<213> Homo sapiens 

<400> 655 

cigacccttgc tcgctaacga cctcatgctc atcaagttgg acgaa 45 

<210> 656 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 656 

Giu Pro Gly Ser Gin Met Val Glu Ala Ser Leu Ser Val Arg His 
5 10 15 

<210> 657 
<21I> 15 
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<212> PRT 

<213> Homo sapiens 
<400> 657 

Glu Ala Asp Gin Glu Pro Gly Ser Gin Met Val Glu Ala Sex Leu 

5 10 15 



<210> 658 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 658 

Gly Leu His Ser Leu Glu Ala Asp Gin Glu Pro Gly Ser Gin Met 
5 10 15 



<210> 659 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 659 

Tyr Thr lie Gly Leu Gly Leu His Ser Leu Glu Ala Asp Gin Glu 
5 10 15 



<210> 660 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 660 

Phe Gin Asn Ser Tyr Thr lie Gly Leu Gly Leu His Ser Leu 
5 10 



<210> 661 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 661 

Leu Ser Ala Ala His Cys Phe Gin Asn Ser Tyr Thr lie Gly Leu 
5 10 15 



<210> 662 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 662 

His Pro Gin Trp Val Leu Ser Ala Ala His Cys Phe Gin Asn Ser 
5 10 15 
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<210> 663 
<211> 15 
<.21Z> PRT 

<213> Homo sapiens 
<40C> 663 

Ser Gly Val Leu Val His Pro Gin Trp Val Leu Ser Ala Ala His 
5 10 15 



<21C> 664 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 664 

Asn Glu Leu Phe Cys Ser Gly Val Leu Val His Pro Gin Trp Val 
5 10 15 



<210> 665 
<211> 15 
<212> PRT 

<213> Hoino sapiens 
<400> 665 

Ala Leu Val Met Glu Asn Glu Leu Phe Cys Ser Gly Val Leu Val 
5 10 15 



<210> 666 

<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 666 

Ser Gin Pro Trp Gin Ala Ala Leu Val Met Glu Asn Glu Leu Phe Cys 
5 10 15 

Ser 



<210> 667 
<212> PRT 

<213> Homo sapiens 
<400> 667 

lie Ser lie Ala Ser Gin Cys Pro Thr Ala Gly Asn Ser Cys Leu 
5 10 15 



<210> 668 

<211> 15 

<212> PRT 

<213> Homo saoiens 



wo 01/51633 



263 



PCT/ljSOl/01574 



<400> 668 

Ser Val Ser Glu Ser Asp Thr lie Arg Ser He Ser He Ala Ser 
5 10 15 

<210> 669 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 669 

He Lys Leu Asp Glu Ser Val Ser Glu Ser Asp Thr He Arg Ser 
5 10 15 



<210> 670 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 670 

Asn Asp Leu Met Leu He Lys Leu Asp Glu Ser Val Ser Glu Ser 

5 10 15 

<210> 671 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 671 

Arg Pro Leu Leu Ala Asn Asp Leu Met Leu He Lys Leu Asp Glu 

<210> 672 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 672 

ggaccagcat atgaggaaca gaaggaatga cactc 35 

<210> 673 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 673 

ccgctcgagt ccaccccaag cttcacagg 29 
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<._..0> 67 4 
<::il> 1959 

- :::: ■ dna 

2 1 H omo sapiens 

--.4 00> 67 4 

_:rqagqaaca gaaggaatga cactctggac agcacccgqa ccctgtactc cagcgcgtct 60 

cqgagcacag acttgtctta cagtgaaagc gacttggtga attttattca agcaaatttt 120 

aagaaacgag aatgtgtctt ctttaccaaa gattccaagg ccacggagaa tgtgtgcaag 180 

tgtggctatg cccagagcca gcacatggaa ggcacccaga tcaaccaaag tgagaaatgg 240 

■in.nacaaga aacacaccaa ggaatttcct accgacgcct ttggggatat tcagtttgag 300 

acactgggga agaaagggaa gtatatacgt ctgtcctgcg acacggacgc ggaaatcctt 360 

tacgagctgc tgacccagca ctggcacctg aaaacaccca acctggtcat ttctgtgacc 420 

qqqqgcgcca agaacttcgc cctgaagccg cgcatgcgca agatcttcag ccggctcatc 480 

tacatcgcgc agtccaaagg tgcttggatt ctcacgggag gcacccatta tggcctgatg 540 

aagtacatcg gggaggtggt gagagataac accatcagca ggagttcaga ggagaatatt 600 

gtggccattg gcatagcagc ttggggcatg qtctccaacc gggacaccct catcaggaat 660 

t.qcgatgctg agggctattt tttagcccag taccttatgg atgacttcac aagagatcca 720 

c.toratatcc tggacaacaa ccacacacat ttgctgctcg tggacaatgg ctgtcatgga 780 

catcccactg tcgaagcaaa gctccggaat cagctagaga agtatatctc tgagcgcact 840 

attcaagatt ccaactatgg tggcaagatc cccattgtgt gttttgccca aggaggtgga 900 

aaagagactt tgaaagccat caatacctcc atcaaaaata aaattccttg tgtggtggtg 960 

gaaqgctcgg gccagatcgc tgatgtgatc gctagcctgg tggaggtgga ggatgccctg 1020 

acatcttctg ccgtcaagga gaagctggtg cgctttttac cccgcacggt gtcccggctg 1080 

cctgaggagg agactgagag ttggatcaaa tggctcaaag aaattctcga atgttctcac 1140 

ctattaacag ttattaaaat ggaagaagct ggggatgaaa ttgtgagcaa tgccatctcc 1200 

tacgctctat acaaagcctt cagcaccagt gagcaagaca aggataactg gaatgggcag 1260 

ctgaagcttc tgctggagtg gaaccagctg gacttagcca atgatgagat tttcaccaat 1320 

gaccgccgat gggagtctgc tgaccttcaa gaagtcatgt ttacggctct cataaaggac 1380 

agacccaagt ttgtccgcct ctttctggag aatggcttga acctacggaa gtttctcacc 1440 

catgatgtcc tcactgaact cttctccaac cacttcagca cgcttgtgta ccggaatctg 1500 

r.agatcgcca agaattccta taatgatgcc ctcctcacgt ttgtctggaa actggttgcg 1560 

-uicttccgaa gaggcttccg gaaggaagac agaaatggcc gggacgagat ggacatagaa 1620 

v^tccacgacg tgtctcctat tactcggcac cccctgcaag ctctcttcat ctgggccatt 1680 

cttcagaata agaaggaact ctccaaagtc atttgggagc agaccagggg ctgcactctg 1740 

qcaqccctgg gagccagcaa gcttctgaaq actctggcca aagtgaagaa cgacatcaat 1800 

qctgctgggg agtccgagga gctggctaat gagtacgaga cccgggctgt tgagctgttc 1860 

actgagtgtt acagcagcga tgaagacttg gcagaacagc tgctggtcta ttcctgtgaa 1920 

qcrtggggtg gactcgagca ccaccaccac caccactga 1959 

<210> 675 
<23 1> 652 
<212> PRT 

<213> fiomo sapiens 
<400> 675 

Met Arg Asn Arg Arg Asn Asp Thr Leu Asp Ser Thr Arg Thr Leu Tyr 
5 10 15 

Ser Ser Ala Ser Arg Ser Thr Asp Leu Ser Tyr Ser Glu Sor Asp Leu 
20 25 30 

Asn Phe lie Gin Ala Asn Phe Lys Lys Arg Glu Cys Val Phe Phe 
35 40 45 

Thr Lys Asp Ser Lys Ala Thr Glu Asn Val Cys Lys Cys Gly Tyr Ala 
5 0 55 60 
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Gin Ser Gin His Met Glu Gly Thr Gin He Asn Gin Ser Glu Lys Trp 
^5 70 75 80 

Asn Tyr Lys Lys His Thr Lys Glu Phe Pro Thr Asp Ala Phe Gly Asp 
85 90 95 

He Gin Phe Glu Thr Leu Gly Lys Lys Gly Lys Tyr lie Arg Leu Ser 
100 105 110 

Cys Asp Thr Asp Ala Glu He Leu Tyr Glu Leu Leu Thr Gin His Trp 
115 120 125 

His Leu Lys Thr Pro Asn Leu Val He Ser Val Thr Gly Gly Ala Lys 
130 135 140 

Asn Phe Ala Leu Lys Pro Arg Met Arg Lys He Phe Ser Arg Leu He 
145 150 155 160 

Tyr He Ala Gin Ser Lys Gly Ala Trp He Leu Thr Gly Gly Thr His 
165 170 175 

Tyr Gly Leu Met Lys Tyr He Gly Glu Val Val Arg Asp Asn Thr He 
180 185 190 

Ser Arg Ser Ser Glu Glu Asn He Val- Ala He Gly He Ala Ala Trp 
195 200 205 

Gly Met Val Ser Asn Arg Asp Thr Leu He Arg Asn Cys Asp Ala Glu 
210 215 220 

Gly Tyr Phe Leu Ala Gin Tyr Leu Met Asp Asp Phe Thr Arg Asp Pro 
225 230 235 240 

Leu Tyr He Leu Asp Asn Asn His Thr His Leu Leu Leu Val Asp Asn 
245 250 255 

Gly Cys His Gly His Pro Thr Val Glu Ala Lys Leu Arg Asn Gin Leu 
260 265 270 

Glu Lys Tyr He Ser Glu Arg Thr He Gin Asp Ser Asn Tyr Gly Gly 
275 280 285 

Lys He Pro He Val Cys Phe Ala Gin Gly Gly Gly Lys Glu Thr Leu 
290 295 300 

Lys Ala He Asn Thr Ser He Lys Asn Lys He Pro Cys Val Val Val 
305 310 315 320 

Glu Gly Ser Gly Gin He Ala Asp Val He Ala Ser Leu Val Glu Val 
325 330 335 

Glu Asp Ala Leu Thr Ser Ser Ala Val Lys Glu Lys Leu Val Arg Phe 
340 345 350 

Leu Pro Arg Thr Val Ser Arg Leu Pro Glu Glu Glu Thr Glu Ser Trp 
355 360 365 

He Lys Trp Leu Lys Glu He Leu Glu Cys Ser His Leu Leu Thr Val 
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370 375 380 

lie Lys Met Glu Glu Ala Gly Asp Giu He Val Ser Asn Ala He Ser 
385 390 395 4C0 

Tyr Ala Leu Tyr Lys Ala Phe Ser Thr Ser Glu Gin Asp Lys A.sp Asn 
405 410 415 

Trp Asn Gly Gin Leu Lys Leu Leu Leu Giu Trp Asn Gin Leu Asp Leu 
420 425 430 

Ala Asn Asp Glu lie Phe Thr Asn Asp Arg Arg Trp Glu Ser Ala Asp 
435 440 445 

Leu Gin Glu Val Met Phe Thr Ala Leu He Lys Asp Arg Pro Lys Phe 
450 455 460 

Val Arg Leu Phe Leu Glu Asn Gly Leu Asn Leu Arg Lys Phe Leu Thr 
465 470 475 480 

His Asp Val Leu Thr Glu Leu Phe Ser Asn His Phe Ser Thr Leu Val 
485 490 495 

Tyr Arg Asn Leu Gin He Ala Lys Asn Ser Tyr Asn Asp Ala Leu Leu 
500 505 510 

Thr Phe Val Trp Lys Leu Val Ala Asn Phe Arg Arg Gly Phe Arg Lys 
515 520 525 

Glu Asp Arg Asn Gly Arg Asp Glu Met Asp He Glu Leu His Asp Val 
530 535 540 

Ser Pro He Thr Arg His Pro Leu Gin Ala Leu Phe He Trp Ala He 
545 550 555 560 

Leu Gin Asn Lys Lys Glu Leu Ser Lys Val He Trp Glu Gin Thr Arg 
565 570 575 

Gly Cys Thr Leu Ala Ala Leu Gly Ala Ser Lys Leu Leu Lys Thr Leu 
580 585 590 

Ala Lys Val Lys Asn Asp He Asn Ala Ala Gly Glu Ser Glu Glu Leu 
595 600 605 

Ala Asn Glu Tyr Glu Thr Arg Ala Val Glu Leu Phe Thr Glu Cys Tyr 
610 615 620 

Ser Ser Asp Glu Asp Leu Ala Glu Gin Leu Leu Val Tyr Ser Cys Glu 
625 630 635 640 

Ala Trp Gly Gly Leu Glu His His His His His His 
645 650 



<210> 676 
<211> 132 
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<212> PRT 

<213> Homo sapien 

<400> 676 

Thr Ala Ala Ser Asp Asn Phe Gin Leu Ser Gin Gly Gly Gin Gly Phe 

15 10 15 

Ala He Pro He Gly Gin Ala Met Ala He Ala Gly Gin He Arq Ser 

20 25 30 

Gly Gly Gly Ser Pro Thr Val His He Gly Pro Thr Ala Phe Leu Gly 

35 40 45 

Leu Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val Gin Arg Val 

50 55 60 

Val Gly Ser Ala Pro Ala Ala Ser Leu Gly He Ser Thr Gly Asp Val 

70 75 80 

He Thr Ala Val Asp Gly Ala Pro He Asn Ser Ala Thr Ala Met Ala 

85 90 95 

Asp Ala Leu Asn Gly His His Pro Gly Asp Val He Ser Val Asn Trp 

100 105 110 

Gin Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr Leu Ala Glu 

115 120 125 

Gly Pro Pro Ala 
130 



<210> 677 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 677 

ggggaattca tgatccggga gaaatttgcc cactgc 36 

<210> 678 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 678 

gggctcgagt caggagtttg agaccagcct ggc 33 



<210> 679 
<211> 675 
<212> DNA 

<213> Homo sapiens 
<400> 679 

atgcatcacc atcaccatca cacggccgcg tccgataact tccagctgtc ccagggtggg 60 
cagggattcg ccattccgat cgggcaggcg atggcgatcg cgggccagat caagcttccc 120 
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,-iC^;gtt cata 
ggcgcacgag 
.5 a :: qacg r ga 
jcgcttaacg 
ggcacgcgta 
gagaaatttg 
?j.qcqacaaga 
L Lyctgcaaa 
gccgctgccc 
gtctcaaact 



tcgggcctac 
tccaacgcgt 
r caccgcggt 
gqcatcatcc 
cagggaacgt 
cccactgcac 
taatggtttt 
at aaagagag 
rcactgaaac 
cctga 



cgccttcctc 
ggt cgggagc 
cgacggcgct 
cggt gacgtc 
gacattggcc 
cgtgctaacc 
agattcagga 
cctattttac 
agcaaaacag 



ggcttgggtg 
gctccggcgg 
ccgat caact 
arct cggtga 
gagggacccc 
attgcacaca 
agactgaaag 
aagatggtgc 
agatggggtt 



tt gt cgacaa 
caagtc L egg 
cggccaccgc 
cctggcaaac 
cggccgaatt 
gattgaacac 
aatatgatga 
aacaactggg 
tcaccatgtt 



caacggcaac 
ca tct ccacc 
gatggcggac 
caagtcgggc 
catgat ccgg 
cattattgac 
gccgtatgtt 
caaggcagaa 
ggccaggctg 



180 
240 
30 0 
360 
420 
480 
540 
600 
660 
675 



<210> 680 
<211> 291 
<212> DNA 
<213> Homo sapiens 



<400> 680 

.ifggggatcc gggagaaatt tgcccactgc accgtgctaa ccattgcaca cagattgaac 60 

accattattg acagcgacaa gataatggtt ttagattcag gaagactgaa agaatatgat 120 

gagccgtatg ttttgctgca aaataaagag agcctatttt acaagatggt gcaacaactg 180 

ggcaaggcag aagccgctgc cctcactgaa acagcaaaac agagatgggg tttcaccatg 240 

ttggccaggc tggtctcaaa ctccctcgag caccaccacc accaccactg a 291 



^''210> 681 

<211> 1074 

<212> DNA 

<213> Homo sapiens 



<400> 681 

atgtcagcca ttgagagggt gtcagaggca atcgtcagca tccgaagaat ccagaccttt 60 

ttgctacttg atgagatatc acagcgcaac cgtcagctgc cgtcagatgg taaaaagatg 120 

gtgcatgtgc aggattttac tgctttttgg gataaggcat cagagacccc aactctacaa 180 

ggcctttcct ttactgtcag acctggcgaa ttgttagctg tggtcggccc cgtgggagca 240 

gqqaagtcat cactgttaag tgccgtgctc ggggaattgg ccccaagtca cgggctggtc 300 

agcgtgcatg gaagaattgc ctatgtgtct cagcagccct gggtgttctc gggaactctg 360 

aggagtaata ttttatttgg gaagaaatac gaaaaggaac gatatgaaaa agtcataaag 420 

gcttgtgctc tgaaaaagga tttacagctg ttggaggatg gtgatctgac tgtgatagga 480 

gatcggggaa ccacgctgag tggagggcag aaagcacggg taaaccttgc aagagcagtg 54 0 

tatcaagatg ctgacatcta tctcctggac gatcctctca gtgcagtaga tgcggaagtt 600 

agcagacact tgttcgaact gtgtatttgt caaattttgc atgagaagat cacaatttta 660 

gtgactcatc agttgcagta cctcaaagct gcaagtcaga ttctgatatt gaaagatggt 720 

aaaatggtgc agaaggggac ttacactgag ttcctaaaat ctggtataga ttttggctcc 780 

^ttttaaaga aggataatga ggaaagtgaa caacctccag ttccaggaac tcccacacta 840 

aggaatcgta ccttctcaga gtcttcggtt tggtctcaac aatcttctag accctccttg 900 

aaagatggtg ctctggagag ccaagataca gagaatgtcc cagttacact atcagaggag 960 

aaccgttctg aaggaaaagt tggttttcag gcctataaga attacttcag agctggtgct 1020 

cactggattg tcttcatttt ccttattctc gagcaccacc accaccacca ctga 1074 



<210> 682 
<211> 224 
<212> PRT 
<213> Homo sapiens 



<:4 00> 682 

Met His His His His His His Thr Ala Ala Ser Asp Asn Fhe Gin Leu 
5 10 15 



Ser Gin Gly Gly Gin Gly Phe Ala lie Pro lie Gly Gin Ala Met Ala 
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20 

He Ala Gly Gin He 
35 

Phe Leu Gly Leu Gly 
50 

Gin Arg Val Val Gly 
65 

Gly Asp Val He Thr 
85 

Ala Met Ala Asp Ala 
100 

Val Thr Trp Gin Thr 
115 

Leu Ala Glu Gly Pro 
130 

His Cys Thr Val Leu 
145 

Ser Asp Lys He Met 
165 

Glu Pro Tyr Val Leu 
180 

Val Gin Gin Leu Gly 
195 

Lys Gin Arg Trp Gly 
210 



25 

Lys Leu Pro Thr Val His 
40 

Val Val Asp Asn Asn Gly 
55 

Ser Ala Pro Ala Ala Ser 
70 75 

Ala Val Asp Gly Ala Pro 
90 

Leu Asn Gly His His Pro 
105 

Lys Ser Gly Gly Thr Arg 
120 

Pro Ala Glu Phe Met He 
135 

Thr He Ala His Arg Leu 
150 155 

Val Leu Asp Ser Gly Arg 
170 

Leu Gin Asn Lys Glu Ser 
185 

Lys Ala Glu Ala Ala Ala 
200 

Phe Thr Met Leu Ala Arg 
215 



30 

He Gly Pro Thr Ala 
45 

Asn Gly Ala Arg Val 
60 

Leu Gly He Ser Thr 
80 

He Asn Ser Ala Thr 
95 

Gly Asp Val He Ser 
110 

Thr Gly Asn Val Thr 
125 

Arg Glu Lys Phe Ala 
140 

Asn Thr He He Asp 
160 

Leu Lys Glu Tyr Asp 
175 

Leu Phe Tyr Lys Met 
190 

Leu Thr Glu Thr Ala 
205 

Leu Val Ser Asn Ser 
220 



<210> 683 
<211> 357 
<212> PRT 

<213> Homo sapiens 
<400> 683 

Met Ser Ala He Glu Arg Val Ser Glu Ala He Val Ser He Arg Arg 
5 10 15 

He Gin Thr Phe Leu Leu Leu Asp Glu He Ser Gin Arg Asn Arg Gin 
20 25 30 

Leu Pro Ser Asp Gly Lys Lys Met Val His Val Gin Asp Phe Thr Ala 
35 40 45 

Phe Trp Asp Lys Ala Ser Glu Thr Pro Thr Leu Gin Gly Leu Ser Phe 
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50 55 60 

Thi Val Arg Pro Gly Glu Leu Leu Ala Val Vai Gly Pro Val Gly Ala 
6b " "* 70 75 80 

Gly Lys Ser Ser Leu Leu Ser Ala Val Leu Gly Glu Leu Ala Pro Ser 

85 90 95 

His Gly Leu Val Ser Val His Gly Arg lie Ala Tyr Val Ser Gin Gin 
100 105 lie 

Pro Trp Val Phe Ser Gly Thr Leu Arg Ser Asn He Leu Phe Gly Lys 
115 120 125 

Lys Tyr Glu Lys Glu Arg Tyr Glu Lys Val He Lys Ala Cys Ala Leu 
130 135 140 

Lys Lys Asp Leu Gin Leu Leu Glu Asp Gly Asp Leu Thr Val He Gly 
145 150 155 160 

Asp Arg Gly Thr Thr Leu Ser Gly Gly Gin Lys Ala Arg Val Asn Leu 
165 170 175 

Ala Arg Ala Val Tyr Gin Asp Ala Asp He Tyr Leu Leu Asp Asp Pro 
180 185 190 

Leu Ser Ala Val Asp Ala Glu Val Ser Arg His Leu Phe Glu Leu Cys 
195 200 205 

He Cys Gin He Leu His Glu Lys He Thr He Leu Val Thr His Gin 
210 215 220 

Leu Gin Tyr Leu Lys Ala Ala Ser Gin He Leu He Leu Lys Asp Gly 
225 230 235 240 

Lys Met Val Gin Lys Gly Thr Tyr Thr Glu Phe Leu Lys Ser Gly He 
245 250 255 

Asp Phe Gly Ser Leu Leu Lys Lys Asp Asn Glu Glu Ser Glu Gin Pro 
260 265 270 

Pro Val Pro Gly Thr Pro Thr Leu Arg Asn Arg Thr Phe Ser Glu Ser 
275 280 285 

Ser Val Trp Ser Gin Gin Ser Ser Arg Pro Ser Leu Lys Asp Gly Ala 
290 295 300 

Leu Glu Ser Gin Asp Thr Glu Asn Val Pro Val Thr Leu Ser Glu Glu 
305 310 315 320 

Asn Arg Ser Glu Gly liys Val Gly Phe Gin Ala Tyr Lys Asn Tyr Phe 
325 330 335 

Arg Ala Gly Ala His Trp He Val Phe He Phe Leu He Leu Glu His 
340 345 350 

His His His His His 
355 
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<210> 684 
<211> 96 
<212> PRT 

<213> Homo sapiens 
<400> 684 

Met Gly lie Arg Glu Lys Phe Ala His Cys Thr Val Leu Thr He Ala 
5 10 15 

His Arg Leu Asn Thr lie He Asp Ser Asp Lys lie Met Val Leu Asp 
20 25 30 

Ser Gly Arg Leu Lys Glu Tyr Asp Glu Pro Tyr Val Leu Leu Gin Asn 
35 40 45 

Lys Glu Ser Leu Phe Tyr Lys Met Val Gin Gin Leu Gly Lys Ala Glu 
50 55 60 

Ala Ala Ala Leu Thr Glu Thr Ala Lys Gin Arg Trp Gly Phe Thr Met 

70 75 80 

Leu Ala Arg Leu Val Ser Asn Ser Leu Glu His His His His His His 
85 90 95 

<210> 685 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 685 

cgcccatggg gatccgggag aaatttgccc actgc 35 

<210> 686 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 686 

cgcctcgagg gagtttgaga ccagcctggc caaca 35 

<210> 687 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
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-■4-::: -87 

C| C H L Q Q d C C t. c 



caacc attgagaggg tgtcagag 



38 



<21G> 688 
■^2il> 34 
<212> DNA 

3 > Artificial Seauenc& 



<223> PCR primer 
<400> 686 

ccgctcgaga alaaggdaaa tgaagacaat ccag 



34 



<210> 689 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> PCR prixer 
<400> 689 

gttgaattca tgcacgggcc ccaggtg 



27 



<210> 690 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR prixer 
<400> 690 

cccctcgagt cactatggtc tgcctcttga 



30 



<210> 691 
<211> 915 
<212> DNA 

<213> Homo sapiens 



<400> 691 

atgcatcacc 

cagggattcg 

accgtt cata 

ggcgcacgag 

ggcgacgtga 

gcgcttaacg 

ggcacgcgta 

ccccaggtgc 

ggggtgcgtc 

ccatgttccc 

aaagcagat g 

gcgagtgagg 



at caccat ca 
ccat t ccgat 
t cgggcctac 
t ccaacgcgt 
tcaccgcggt 
ggcatcat cc 
cagggaacgt 
tggcacgctg 
tggagggggt 
acagcct gag 
gccct t ggcc 
tt ggtggctg 



cacgg'ccgcg 
cgggcaggcg 
cgccttcctc 
ggtcgggagc 
cgacggcgct 
cggtgacgtc 
gacattggcc 
ct ccgagtgt 
ggaccggcca 
tggctgccac 
ctaccttttt 
tgcccccagc 



tccgataact 
atggcgatcg 
ggcttgggtg 
gctccggcgg 
ccgatcaact 
atctcggtga 
gagggacccc 
gcttgtcctg 
ccaaccttac 
ctgatggctg 
gttagaagaa 
tcctggcgcg 



t ccagctgtc 
cgggccagat 
ttgtcgacaa 
caagtctcgg 
cggccaccgc 
cctggcaaac 
cggccgaatt 
ccttggctgc 
ccagtcaagg 
atggagcaaa 
ctgatgttcc 
ccctcgcaga 



ccagggtggg 
caagcttccc 
caacggcaac 
catctccacc 
gatggcggac 
caagtcgggc 
catgcacggg 
cacctctgcg 
aagtggatgg 
ggccttagga 
atgtcctgca 
ggtgactggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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tgctctttgg gccctcttgg ccttgcccag catgcacaag cctcagtgct actactgtgc 780 
tacaaatgga gccatatagg ggaaacgagc agccatctca ggagcaaggt gtatgctgcc 840 
tttgggggct ccagtccttg cctcaagggt cttatgtcac tgtgggcttc ttggttgtca 900 
3.gaggcagac catag 915 

<210> 692 
<211> 304 
<212> PRT 

<213> Homo sapiens 
<400> 692 

Met His His His His His His Thr Ala Ala Ser Asp Asn Phe Gin Leu 
5 10 15 

Ser Gin Gly Gly Gin Gly Phe Ala He Pro He Gly Gin Ala Met Ala 
20 25 30 

He Ala Gly Gin He Lys Leu Pro Thr Val His He Gly Pro Thr Ala 
35 40 45 

Phe Leu Gly Leu Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val 
50 55 60 

Gin Arg Val Val Gly Ser Ala Pro Ala Ala Ser Leu Gly He Ser Thr 
65 70 75 80 

Gly Asp Val He Thr Ala Val Asp Gly Ala Pro He Asn Ser Ala Thr 
85 90 95 

Ala Met Ala Asp Ala Leu Asn Gly His His Pro Gly Asp Val He Ser 
100 105 110 

fcci^ xxxx- u.rj^ w-Lai Mjyi3 cjcrj. i^±y crx y ± Fir Hzg xnr Lr^y >isn vax inr 

115 120 125 

Leu Ala Glu Gly Pro Pro Ala Glu Phe Met His Gly Pro Gin Val Leu 
130 135 140 

Ala Arg Cys Ser Glu Cys Ala Cys Pro Ala Leu Ala Ala Thr Ser Ala 
145 150 155 160 

Gly Val Arg Leu Glu Gly Val Asp Arg Pro Pro Thr Leu Pro Ser Gin 
165 170 175 

Gly Ser Gly Trp Pro Cys Ser His Ser Leu Ser Gly Cys His Leu Met 
180 185 190 

Ala Asp Gly Ala Lys Ala Leu Gly Lys Ala Asp Gly Pro Trp Pro Tyr 
195 200 205 

Leu Phe Val Arg Arg Thr Asp Val Pro Cys Pro Ala Ala Ser Glu Val 
210 215 220 

Gly Gly Cys Ala Pro Ser Ser Trp Arg Ala Leu Ala Glu Val Thr Gly 
225 230 235 240 

Cys Ser Leu Gly Pro Leu Gly Leu Ala Gin His Ala Gin Ala Ser Val 
245 250 255 
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Leu Leu Leu Cys Tyr Lys Trp Ser His lie Gly Glu Thr Ser Ser His 

260 265 270 

Leu Arg Ser Lys Val Tyr Ala Ala Phe Gly Gly Ser Ser Pro Cys Leu 

275 280 285 

Lys Gly Leu Met Ser Leu Trp Ala Ser Trp Leu Ser Arg Gly Arg Pro 

290 295 ' 300 



<210> 693 
<211^ 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 693 

cqaagtcacg tggaggccag cctc 24 



<210> 694 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<4 00> 694 

cctqaccgaa ttcattaact ggcctggac 29 



<210> 695 
<211> 166 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 

<222> (1) . . . (166) 

<223> Xaa ~ Any 7\inino Acid 



<400> 695 

Met Gly His His His His His His 

1 5 

His Pro Glu Tyr Asn Arg Pro Leu 
20 

Lys Leu Asp Glu Ser Val Ser Glu 

35 40 

He Ala Ser Gin Cys Pro Thr Ala 

50 ^ 55 

Trp Gly Leu Leu Ala Asn Gly Arg 

65 70 

Asn Val Ser Val Val Ser Glu Glu 



Val Glu Ala Ser Leu Ser Val Arg 

10 15 

Leu Ala Asn Asp Leu Met Leu He 

25 30 

Ser Asp Thr He Arg Ser He Ser 
45 

Gly Asn Ser Cys Leu Val Ser Gly 
60 

Met Pro Thr Val Leu Gin Cys Val 

75 ^ 80 

Val Cys Ser Lys Leu Tyr Asp Pro 
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on 










yo 








ri J, i3 


trl/O 




1X1 en 


Jriie 


Cys Ala 


G-Ly 


Gly 


Gly 


Gin 


Xaa 


Gin 


Xaa 








100 








105 










110 






Asp 


Ser 


Cys 


Asn 


Gly 


Asp 


Ser 


Gly Gly 


Pro 


Leu 


lie 


Cys 


Asn 


Gly 


Tyr 






115 










120 








125 








Leu 


Gin 


Gly 


Leu 


Val 


Ser 


Phe 


Gly Lys 


Ala 


Pro 


Cys 


Gly 


Gin 


Val 


Gly 




130 










135 








140 










Val 


Pro 


Gly 


Val 


Tyr 


Thr 


Asn 


Leu Cys 


Lys 


Phe 


Thr 


Glu 


Trp 


He 


Glu 


145 










150 








155 










160 


Lys 


Thr 


Val 


Gin 


Ala 


Ser 





















165 



<210> 696 
<211> 504 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (504 ) 
<223> n = A, 7, C or G 



<400> 696 

atgggccatc 

aacagaccct 

tctgacacca 

ctcgtttctg 

aacgtgtcgg 

agcatgttct 

gggcccctga 

ggccaagttg 

aaaaccgtcc 



atca tcatca 
tgct cgctaa 
tccggagca t 
gctggggtct 
tggtgtctga 
gcgccggcgg 
t ctgcaacgg 
gcgtgccagg 
aggccagtta 



tcacgtggag 
cgacctcatg 
cagcattgct 
gctggcgaac 
gqaggtctgc 
agggcaanac 
gtacttgcag 
tgtctacacc 
atga 



gccagcctct 
ctcatcaagt 
tcgcagtgcc 
ggcagaatgc 
agtaagctct 
cagaangact 
ggccttgtgt 
aacctctgca 



ccgtacggca 
tggacgaatc 
ctaccgcggg 
ctaccgtgct 
atgacccgct 
cctgcaacgg 
ctttcggaaa 
aattcactga 



cccagagtac 
cgtgtccgag 
gaactcttgc 
gcagtgcgtg 
gtaccacccc 
tgactctggg 
agccccgtgt 
gtggatagag 



60 
120 
180 
240 
300 
360 
420 
480 
504 



<210> 697 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 697 

ctcagggttc cggagccgcg g 21 

<210> 698 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 698 

ctatagaatt cattaccaaa aagctgggct ccagc 35 



<210> 699 
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